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This text aims to give a pedagogical introduction into the main 
concepts of the theory of structure formation in the universe. The 
text is suited for graduate students of astronomy with a moderate 
background in general relativity. A special focus is laid on deriv- 
ing the results formally from first principles. In the first chapter 
we introduce the homogeneous and isotropic universe defining the 
framework for the theory of structure formation, which is dis- 
cussed in the three following chapters. In the second chapter we 
describe the theory in the Newtonian framework and in the third 
chapter for the general relativistic case. The final chapter dis- 
cusses the generation of perturbations in the very early universe 
for the simplest models of inflation. 
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Preface 



This text aims to give a pedagogical introduction into the main concepts of the theory of structure 
formation in the universe. The text is suited for graduate students of astronomy with a moderate 
background in general relativity. A special focus is laid on deriving the results formally from first 
principles. 

During my PhD studies on high redshift galaxy groups I had the endeavor to understand the 
theoretical framework on which my research was based. I did not only want to be able to reproduce 
statements that were made in the books, but to understand where they came from and what their 
underlying assumptions were. Therefore I read several books on cosmology and worked out the basic 
theory the way I found it the most accessible. In the course of time I filled a couple of notebooks 
this way, until I finally started to convert part of them into digital form as an introduction for my 
PhD thesis. But, as the introduction turned out to be way too long, I eventually did not include 
it into my thesis. However, since I had already spent much work compiling this text and since I 
was encouraged by the positive feedback of students who had read it, I decided to assemble it into a 
self-contained introduction on cosmological structure formation in order to make it also available to 
other graduate students with the same interest as me. 

I attempted to make a common theme obvious, which is the question of how structures in the 
universe were created and grew to the present time large-scale structure of galaxies that is observable 
today. The selected material is supposed to be self-contained, but nevertheless concise. I put a special 
focus on clarifying how the results are formally derived from underlying fundamental principles and 
which assumptions were made. This still does not imply every single calculation to be included in 
detail. For instance, the derivation of the Robertson- Walker metric is not performed specifically, 
as "maximally symmetric spaces" are a special part of general relativity rather than astrophysical 
cosmology. However, I show certain conditions to be satisfied so that I can refer to a common theorem 
of general relativity in the literature that uniquely leads to the Robertson-Walker metric. Besides it 
is generally only the simplest possible case worked out in a given context, since these cases often allow 
elegant, rigorous derivations. This approach is motivated by the fact that the simplest cases already 
allow a sufficient qualitative understanding and that realistic quantitative results in the context of 
structure formation usually require large numerical computations. 

Unfortunately, technical texts with a high aspiration for completeness and rigorousness are prone 
to become longish, while concise texts have the tendency to be incomplete, inaccurate, or ambiguous. 
For this reason this introduction contains unusually many footnotes compared to other astronomical 
texts. In order to keep the central theme as straight and concise as possible, I included many minor 
comments and sometimes also short derivations in the form of footnotes. That is the basic text 
(without footnotes) is self-contained, while the footnotes provide additional comments and assistance. 
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It is exactly these footnotes, which can be helpful for students to understand certain subtleties. Those 
readers who are not interested in these details can just omit them whithout losing the thread. 

There are still many topics that would fit into the theme of this introduction, which were omitted, 
such as the derivation of the cosmic microwave background fluctuations or a more detailed analysis 
of the nonlinear regime of the large-scale structure. So naturally the material selected represents 
only a tiny fraction of interesting and important topics in the context of structure formation. 

Since I aim to maintain this text, all sorts of comments are welcome. If you find typos or if you 
have suggestions on how to improve the content, I would be grateful, if you sent me a message to 
knobelc@phys.ethz.ch, so that I can update it. 

Structure of the introduction 

This introduction is divided into four chapters. The first two are easier to understand than the last 
two and need only little input from general relativity (only for the derivation of the Robertson- Walker 
metric and the Friedmann equation). The second chapter is even fully based on Newtonian physics 
and yet contains most of the results that are presented. On the other hand, the last two chapters 
and the appendix are entirely based on general relativity and are much more technical than the first 
two chapters. They require a moderate background in general relativity, although we still derive all 
results starting with the field equations. Readers who are not interested in the theory of general 
relativistic structure formation can just omit the latter two chapters and focus on the former ones, 
which are mostly self-contained. 

In Chapter 1 we discuss the universe in the homogeneous and isotropic limit. Starting with 
the Robertson- Walker metric, we derive the Friedmann equation and describe the dynamics of the 
universe for given energy contents. We introduce many basic concepts that will be needed in further 
chapters, such as redshift, comoving distance, and horizons. Then we give an overview of the current 
concordance cosmology (i.e. the ACDM model) and briefly summarize the history of the universe. 
Finally, we provide an introduction into the phenomenology of the simplest models of inflation. 

In Chapter 2 we present the theory of structure formation based on Newtonian physics which is 
valid well inside the horizon. Interestingly, using Newtonian physics it is possible to derive basically 
all of the main results for the formation of structures in the universe (except of the form of the 
primordial power spectrum) and this is the reason why many books entirely omit a treatment of the 
general relativistic case. We first derive the equations governing the growth of fluctuations at first 
(linear) order within an expanding universe from the basic hydrodynamical equations and discuss 
the different possible fluctuation modes. We afterwards introduce the correlation function and the 
power spectrum to describe the perturbations in statistical terms. In a further step, we leave the 
linear regime and describe the formation of nonlinear bound structures ("halos") by means of the 
(simplistic) "spherical top hat model" . We motivate approximative formulas for the number density 
and spatial correlation of dark matter halos in the universe. Finally, we introduce the "halo model" , 
which is the current scheme for analyzing the clustering of galaxies in the universe. 

In Chapter 3 we give an introduction into the linear theory of hydrodynamic perturbations in 
the general relativistic regime, which is needed to understand the evolution of structures outside 
the horizon. Without this theory, it is impossible to understand how structures that were created 
during inflation evolved until present time. Compared with the Newtonian linear theory, the general 
relativistic case is much more complicated and also allows perturbations which have no Newtonian 
counterpart (e.g. gravitational waves). We first introduce the Scalar- Vector- Tensor decomposition to 
simplify the perturbed field equations, since they decouple into independent scalar, vector, and tensor 
equations. Next, we further simplify the field equations by introducing "gauge transformations" and 
choosing particular gauges. We compare our results to the Newtonian ones from the previous chapter 
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for the limiting case well inside the horizon and give a sketch for the general treatment which treats 
the fluctuations by means of the general relativistic Boltzmann equation. 

Chapter 4 is the most technical chapter of all and basically consists of one single big calculation. 
The aim is to compute the form of the primordial dark matter power spectrum that results from 
the generation of fluctuations by the simplest models of inflation. The chapter is divided up into 
three parts: First, we quantize the perturbations of a scalar field inside the horizon during inflation 
and compute the power spectrum of the fluctuations in the ground state, second we show that under 
certain conditions the perturbations remain constant outside the horizon, and finally we compute the 
spectrum of the perturbations after they have reentered the horizon during the matter dominated 
era and compute the deviations from scale invariance that are expected from slow-roll inflation. 

The appendix gives an introduction into the theory of a classical scalar field in the context of 
general relativity. We derive the equations of motion for the scalar field in a smooth Friedmann- 
Robertson- Walker universe and then for the general relativistic linear theory of perturbations. 

Some key terms that are frequently used are abbreviated: dark matter (DM), large-scale structure 
(LSS), Friedmann-Lemaitre- Robertson- Walker (FLRW), cosmic microwave background (CMB), and 
A cold dark matter (ACDM). 
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Homogeneous and isotropic universe 



Astrophysical cosmology needs a theoretical framework that allows the interpretation of observational 
data. Without such a framework not even the most basic observational properties of galaxies, such as 
redshift, apparent luminosity or apparent size, could be interpreted properly. The current theoretical 
framework accepted by most astronomers is the "concordance model", which is a special case of 
a Friedmann-Lemaitre-Robertson- Walker (FLRW) world model. These models are based on the 
assumption that the universe is governed by general relativity and are essentially homogeneous and 
isotropic, if smoothed over large enough scales. 

In this chapter we introduce the FLRW models and how observational data is interpreted within 
them. It builds the basis for all other chapters. In Section 1.1 we briefly discuss the philosophical 
assumptions behind the FLRW models and in the Sections 1.2 and 1.3 we develop the mathematical 
formulation of the FLRW models. In Section 1.4 we introduce redshift, peculiar velocities and discuss 
the structure of causality within the FLRW world models. Then in Section 1.5 we restrict the FLRW 
world models to the current concordance cosmology and in Section 1.6 we discuss some particularities 
of the concordance model and what they might tell us about the very early universe. 

1.1. The cosmological principle 

Modern cosmology is based on two fundamental assumptions: First, the dominant interaction on 
cosmological scales is gravity, and second, the cosmological principle is a good approximation to the 
universe. The cosmological principle states that the universe, smoothed over large enough scales, 
is essentially homogeneous and isotropic. "Homogeneity" has the intuitive meaning that at a given 
time the universe looks the same everywhere, and "isotropy" refers to the fact that for any observer 
moving with the local matter the universe looks (locally) the same in all directions. The precise 
formulation and the consequences of these two concepts in the context of general relativity will be 
discussed in the next section. But first we want to explore a bit more the philosophical issues of the 
cosmological principle 1 . 

How can the cosmological principle be justified? Obviously, the universe is not homogeneous and 
isotropic on scales as big as our Solar System, our Galaxy or even our Local Group of galaxies. Nev- 
ertheless the cosmological principle has been invoked from the beginning of modern cosmology in the 
first half of the 20th century, when almost nothing about the large-scale structure in the universe was 



1 Ellis (2006) provides a systematic discussion of philosophical issues for cosmology, which can be warmly recom- 
mended. 
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known. The main reasons for its acceptance were simplicity and the "Copernican principle" . Apply- 
ing the cosmological principle to general relativity yields rather strong constraints and leads to the 
simplest category of realistic cosmological models. 2 On the other hand, the Copernican principle 
according to which we do not occupy any special place in the universe fits the cosmological principle 
perfectly (Ellis 2006, Sect. 4.2.2). If we perceive the universe around us isotropically, the Copernican 
principle asserts that also other observers should see the universe isotropically, since otherwise we 
would occupy a special place in the universe. Since a universe that is isotropic everywhere is also 
homogeneous (in fact, isotropy around three distinct observers suffices), the cosmological principle is 
a relatively straightforward conclusion from an observed isotropy and the Copernican principle. 

Over the last two decades, the amount of data in astronomy has grown immensely so that today 
the cosmological principle can be discussed in the context of a wealth of different and detailed 
observations, even though only consistency statements are possible. 3 For instance, the isotropy of 
the universe with respect to the Milky Way has been strongly confirmed by the remarkable isotropy 
of the cosmic microwave background (CMB, see Sect. 1.5.2) as observed by the satellites COBE and 
WMAP. If the dipole 4 in the CMB is interpreted as relative motion of the Earth with respect to 
the CMB rest frame, the degree of isotropy is as high as 10~ 5 (Smoot et al. 1992). On the other 
hand, huge low-redshift galaxy surveys such as the 2-degree field galaxy redshift survey (2dfGRS, 
Colless et al. 2001) and the Sloan digital sky survey (SDSS, York et al. 2000) have convinced most 
cosmologists that not only isotropy but also homogeneity is in fact a reasonable assumption for the 
universe. Taking a glance at light cones produced by these surveys (see Figure 1.1) one can see 
(even without any statistical tools) that the fractal nature of the universe stops at a certain scale and 
builds a net of clusters and filaments which is called large scale structure (LSS). However, it is still 
difficult to exactly estimate the scale on which the universe becomes homogeneous. Hogg et al. (2005) 
investigated the spatial distribution of a big sample of luminous red galaxies (LRG) from SDSS and 
found that after applying a smoothing scale of about 100 Mpc, the spatial distribution approaches 
homogeneity within a few percent. In a more recent study, Scrimgeour et al. (2012) demonstrated 
by selecting 200,000 blue galaxies within an unprecedented huge volume that a fractal distribution 
of galaxies on scales from about 100 Mpc to 400 Mpc can be excluded with high confidence. 



2 For instance, Weinberg (1972, p. 408) expresses this spirit by writing: 

The real reason, though, for our adherence here to the Cosmological Principle is not that it is surely 
correct, but rather, that it allows us to make use of the extremely limited data provided to cosmology 
by observational astronomy. If we make any weaker assumptions, as in the anisotropic or hierarchical 
models, then the metric would contain so many undetermined functions (whether or not we use the field 
equations) that the data would be hopelessly inadequate to determine the metric. On the other hand, by 
adopting the rather restrictive mathematical framework described in this chapter, we have a real chance 
of confronting theory with observation. If the data will not fit into this framework, we shall be able to 
conclude that either the Cosmological Principle or the Principle of Equivalence is wrong. Nothing could 
be more interesting. 

For a historical account of the beginning of modern cosmology we refer to Nussbaumer & Bieri (2009). 

Consistency is generally very important in cosmology as it is often the only way to "test" a paradigm. Since the 
conversion of astronomical observations such as redshifts, apparent luminosities and apparent sizes into distances, 
absolute luminosities and physical sizes depend on the adopted cosmological framework, also our reconstruction of 
the universe depends on cosmology. This is why only consistency statements within a given framework are possible. 
In principle there could be several cosmological frameworks based on different assumptions leading to consistent 
interpretations of the observations. 

4 The dipole leads to a relative motion of the center of the Milky Way with respect to the rest frame of the CMB 
of about 552 km/s (Kogut et al. 1993). This is comparable to the peculiar velocities of other galaxies which are 
typically in the range of a few hundred km/s depending on the cosmic environment, thus interpreting the dipole as 
due to the relative motion of the Milky Way is consistent within the concordance model. 
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Figure 1.1. The large-scale structure (LSS) as observed with the largest current, spectroscopic 
galaxy surveys. The upper panel displays two cones of the 2dfGRS and the lower panel two 
cones of SDSS. Every point in the figure corresponds to a galaxy. For the SDSS cones, the 
galaxies are colored according to the ages of their stars, where red corresponds to older stellar 
populations. The structures are shown out to redshift z ~ 0.2 which corresponds to a light 
travel time of about 2.6 Gyr and a comoving distance of about 800 Mpc for the concordance 
cosmology (see Sects. 1.4 and 1.5). For both surveys the flux limit becomes apparent at this 
redshift. Obviously, the LSS is made up of sheets and filaments of galaxies, which can be as 
big as 100 Mpc. It should be noted that only the LSS of the luminous matter is visible on such 
diagrams. (Credits: 2dfGRS team, and M. R. Blanton and the SDSS team) 
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Despite these observational confirmations, the cosmological principle remains a fundamental as- 
sumption (Ellis 2006, Sect. 4.2.2), and moreover we can hardly make any reasonable statement about 
the state of the universe far beyond our current horizon of causality (cf. Sect. 1.6.4, where the cosmo- 
logical principle is revisited in the context of inflation). But even regarding the universe within our 
horizon, there are still some cosmologists sharing doubts about the validity of the cosmological prin- 
ciple or at least exploring other possibilities. Doubts are mainly raised by the apparent acceleration 
of the universe as observed by type la supernovae, which is accounted for by invoking dark energy 
(cf. Sect. 1.5). It has been claimed that this acceleration could just be an artefact caused by inho- 
mogeneities in the universe due to the nonlinearity of Einstein's field equations without any actual 
acceleration taking place. This effect is called backreaction (see Clarkson et al. 2011 for a review). 
Although it was not yet possible to rule out that the interpretations of our observations are to some 
extent affected by backreaction, there are good arguments why it should be negligible at least on 
cosmological scales. However, the best argument for the validity of the cosmological principle is pre- 
sumably the remarkable consistency of several independent observables, such as CMB anisotropics, 
galaxy power spectra, type la supernovae, cluster abundances and others (see e.g. Dunkley et al. 
2009, Sect. 4.2), within the framework of the concordance model. 

Throughout this introduction we will assume that the metric and the dynamics of the universe are 
well described to zeroth order by the smoothed homogeneous and isotropic universe, and that the 
observed inhomogeneities in the universe can be treated as perturbations within the homogeneous 
and isotropic background. 

1.2. The Robertson-Walker metric 

In this section we discuss the metric of the universe as required by the cosmological principle. This 
metric determines all geometrical properties of the universe, such as the distance between two points 
or the apparent extension of an object with known diameter if seen at a given distance. In a first 
step we will present the mathematical formulation and in a second step its physical interpretation. 

1.2.1. Mathematical formulation 

According to the first fundamental assumption of the previous section, cosmology is based on general 
relativity being the best theory of gravity so far. So we regard spacetime as a pseudo-Riemannian 
manifold M with metric g^ u , where the latter is determined by the Einstein field equations. Then, the 
second fundamental assumption (i.e. the cosmological principle) states that the universe is essentially 
homogeneous and isotropic. In order to apply these properties to the manifold Ai, we have to carefully 
paraphrase the intuitive notion of these terms in the context of the general relativistic spacetime. 
Hereto we mainly follow the outline of Wald (1984, Ch. 5) and Misner (1973, Sect. 27.3). 

"Homogeneity" refers to the intuitive notion that at a given time t spacetime looks the same at any 
place. However, in general relativity there is no "absolute simultaneity", since whether two events 
happen at the same time depends not only on the chosen reference frame, but also on the metric Q^u' 
"At a given time" means in general relativity "on a given spacelike hypersurface" . So homogeneity 
is interpreted such that the whole manifold M. can be sliced up into a one-parameter family of 
spacelike hypersurfaces (slices) S< (see Fig. 1.2), which are homogeneous. That is for any time t 
and for any two points p,q G S t , there exists a diffeomorphism (i.e. a coordinate transformation) of 
spacetime that carries p into q and leaves the metric g^ u invariant. 

To introduce the concept of "isotropy" we first note that an isotropic universe will not appear 
isotropic to any observer. For instance, if the universe appears isotropic to an observer at rest within 
the Milky way, then it would not appear so to an observer moving away from the Milky way at half 
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Figure 1.2. A schematic illustration of the manifold Ai. Shown is the world line of a 
fundamental observer O that pierces through the spatial slices E^, Tt 2 , and Tt 3 of constant 
time ti, t2, and £3, respectively. The tangent vector of the line O in the point p is denoted 
by u^, whereas is a spatial vector perpendicular to it. Since a fundamental observer is free 
falling, the metric in the restframe of the observer is locally Minkowskian (i.e. the observer 
does not feel any gravity) and thus diagonal. That is, the vector must be perpendicular 
to its local slice of simultaneity (dashed line). If the slices are homogeneous, the fundamental 
observer will intersect them perpendicularly. 



the speed of light. Such an observer would detect light coming toward him with much higher intensity 
than from behind. So, to study isotropy we have to consider the "world lines" of observers. Let be 
the tangent vector along the world line in a point p and v% and v% any two unit vectors perpendicular 
to it (see Fig. 1.2). Isotropy then means that there exists a diffeomorphism of spacetime with fixed 
p and that carries v± into v% and leaves the metric g^ v invariant. An observer that sees the 
universe (locally) isotropic at any time is called a "fundamental observer" . 

As mentioned in the previous section, a fundamental observer must be moving with his local matter 
(otherwise the local flow of matter would indicate a preferred direction). Moreover, such an observer 
must also be free falling (otherwise the gravitational force acting on the observer would introduce 
a preferred direction) and thus the metric in its restframe is locally Minkowskian (i.e. the observer 
does not feel any gravity). That is, we can choose coordinates in the restframe of the observer such 
that the metric takes the familiar form g^ u = diag(— 1, 1, 1, 1) at any point along the world line. 
Since for an observer at rest the tangent vector along the world line points in the direction of the 
time coordinate and since the metric is diagonal, the tangent vector is always perpendicular to 
the observer's local slice of simultaneity (see Fig. 1.2). 

Invoking both, homogeneity and isotropy, is already more than needed as mentioned in the previous 
section. It is sufficient to assume the existence of a single fundamental observer in addition to 
homogeneity to fully determine the metric g^ v as long as this observer crosses every slice Tt. Then it 
is easy to see that the world line of the fundamental observer always pierces perpendicular through 
the homogeneous slices Tt, i.e. is perpendicular to Tt for any t. (Geometrically speaking, the slice 
of local simultaneity perpendicular to in the point p is tangential in p to the slice Tt going through 
p.) If u M was not perpendicular to Tt, the observer would be in motion relative to the homogeneous 
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slice and would therefore observe a preferred direction in the universe. 5 Due to this perpendicularity 
of the fundamental observer to the slice St, the isotropy condition fully applies to the slice St. That 
is, if we regard the slice St, which is a submanifold of A4, as an independent manifold, the isotropy 
around our fundamental observer implies the following: For any two tangential vectors v± and v% of St 
in p there exists a diffeomorphism on Sf which carries into v%, while leaving p and g^ v (restricted 
on St) invariant. This is exactly the definition of an arbitrary manifold being isotropic around a 
point p. Thus any slice St is not only homogeneous, but also isotropic around the point where 
the fundamental observer intersects it. Since for any manifold homogeneity and isotropy around a 
point entail maximal symmetry (e.g. Weinberg 1972, Sect. 13.1), any slice St constitutes a three 
dimensional maximally symmetric space, i.e. a three dimensional space with constant curvature. 

A spacetimes that is made up of maximally symmetric spatial slices has almost no remaining 
degrees of freedom. It can be shown (e.g. Weinberg 1972, Sect. 13.5) that for such a spacetime there 
always exist coordinates x = (x°, x 1 , x 2 , x 3 ) = (ct, x, 9, f) such that the metric g^ v takes the form of 
the Robertson- Walker metric whose line element is 

ds 2 = g„ v {x) dx^dx v = -c 2 dt 2 + R 2 (t) 7ij -(x, 9, if) dx i dx j (1.1) 

with 7y being the metric of a three dimensional space of constant curvature, which is generally 
described by 6 

d-v 2 

7ij ( X , 9, 99) dx*dx> = 1 _ K 2 + X 2 (d9 2 + sm 2 (9) dip 2 ) . (1.2) 

By adjusting the coordinate %■> the constant K can always be normalized to one of the three discrete 
values 1, 0, or —1 specifying the geometry of the slice, where K = 1 corresponds to positvely curved, 
K = to flat, and K = —1 to negatively curved space. In Eq. (1.1), t is called cosmic time 
(or epoch), R(t) is the cosmological world radius, and (x, </?) are spatial spherical comoving 
coordinates for reasons that will become clear in the next section. While R(t) takes units of length, 
the comoving coordinates (x, 9, ip) are dimensionless. The ranges of values for the coordinates are 7 

0<X<{ ~' kZo' 1 0<9<tt, 0<^<2tt, (1.3) 

5 Here, we assumed that the homogeneous slices E t are unique, i.e. for a given point p there is exactly one homogeneous 
slice that passes through p. There are, however, cases (e.g. Minkowski spacetime, de Sitter spacetime) for which 
there is not a unique way how spacetime can be split into homogeneous slices E t . Nevertheless, for these cases we 
can always find a corresponding family of homogeneous slices which are perpendicular to the fundamental observer 
(see Wald 1984, Ch. 5). 

6 The generality of this expression is guaranteed by the uniqueness theorem for maximally symmetric spaces. Two 
maximally symmetric spaces with the same curvature and the same metric signature are always isometric to each 
other (see Weinberg 1972, Sect. 13.2). Since K is the curvature of 7^ and can take any value, we can for a given 
maximally symetric space (with the right metric signature) choose coordinates, so that the metric takes the form 

Of fij. 

7 We have not yet said anything about the mathematical topology of our spacetime M. With the ranges of coordinates 
in Eq. (1.3) the spatial slices are topologically homeomorph to the 3-sphere S 3 in the case K — 1, to the Euclidian 
space R 3 in the case K = 0, and to the hyperbolic 3-space H 3 in the case K = — 1. These topological spaces are 
all simply connected, that is any closed line on these slices can be continuously contracted to a point, and the 
volumes of the universe in the cases K = 0, 1 are infinite. However, general relativity is a local theory, i.e. the 
metric determines the local properties of spacetime, but not its global structure. There are also multi-connected 
topological spaces consistent with our maximal symmetric slices and for these topologies the allowed parameter 
range is smaller than the one admitted in Eq. (1.3). Some of these topologies even allow finite volumes for the 
universe in the cases K = 0, 1. As different topologies can lead to different observational results and can also 
affect the growth of structure in the universe, they have to be considered as possible models for the universe. A 
systematic and pedagogical introduction into this topic is given by Lachieze-Rey & Luminet (1995). Since the 
standard simply-connected topologies are so far consistent with all measurements, we will stick to this case for 
simplicity. 
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where 9 and <p are the standard angle coordinates on the sphere, and x is a sor t of radial coordinate. 
A detailed discussion of the physical interpretation of the Robertson- Walker-metric will be given in 
the next section. 

Although Eq. (1.1) is already the general Robertson- Walker metric, it is often convenient to use 
slightly different forms of it. If we perform the substitution x = fx(r), where /^(f) is defined by 



Mr) 



the Eqs. (1.1) and (1.2) together become 




if K = 1 
if K = 
if K = -1 



(1.4) 



ds 2 = -c 2 dt 2 + R 2 {t) \dr 2 + /|(f) (d9 2 + s\n 2 (9)dip 2 ) 



(1.5) 



The new coordinate r is still dimensionless, but it is now proportional to the physical distance from 
the coordinate origin as we shall see in the next section. Note that for K = 1, the spatial part of 
Eq. (1.5) takes the standard form of the 3-sphere with radius R(t), where f only takes values in 
the range < f < tt and just plays the role of another angle coordinate in addition to 9 and (p. 
The Robertson- Walker metric in the form of Eq. (1.5) is very compact and allows a straightforward 
interpretation of the comoving coordinates (r,9,(j)). However, there is still another form which is 
more common among cosmologists even if slightly less compact. To derive it, we introduce the 
dimensionless scale factor 

R(t) 



a(t) = 



(1.6) 



R(to) ' 

where to denotes an arbitrary reference epoch that is usually chosen to be the present time. With 
the coordinate transformation r = R(to)r, Eq. (1.5) becomes 



ds 2 = -c 2 dt 2 + a 2 (t) dr 2 + R 2 f 2 K {r/R ) (d9 2 + sin 2 (0) dtp 2 ) 



(1.7) 



where Rq = R(to). Here a(t), 9, and (p are dimensionless, and r takes units of length. This version of 
the Robertson- Walker metric has the advantage that it holds a(to) = 1, and the comoving coordinates 
(r, 9, <f>) are the usual spherical coordinates taking physical units. Eq. (1.7) is the form of the metric 
we will mainly work with. 

Sometimes it is convenient to work with another time coordinate. So we introduce the conformal 
time r by setting 



dr = 



a(t) 



dt . 



Using conformal time r instead of cosmic time t the scale factor moves in front of the total metric 



ds 2 = a 2 {r) \ - dr 2 + dr 2 + R 2 Q f 2 K {r/R G ) (d9 2 + sm 2 (9)d<p 2 ) 



(1.9) 



The main advantage of using conformal time is that it allows analytic solutions to the time evolution of 
open and closed universes (cf. Sect. 2.3.1), and the metric undergoes just a conformal transformation 
as r changes. 
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1.2.2. Physical interpretation 

It is relatively easy to see that the fundamental observers are the observers at fixed comoving co- 
ordinates, i.e. those at rest to the homogeneous slices. For instance, the observer at x = is a 
fundamental observer, since he sees an entirely isotropic universe at any time due to the rotational 
symmetry of 7^. But this also holds for any other point with fixed comoving coordinates, since 7^ 
is maximally symmetric and so any point with fixed comoving coordinates could have been chosen 
as the spatial origin. We will term the fundamental observers also comoving observers. 

How are the comoving observers related to the matter (i.e. galaxies) in the universe? As already 
mentioned in the previous section, the fundamental observers must be at rest relative to local flow of 
matter. So, as long as the symmetry of the universe is perfect, all matter will stay at fixed comoving 
coordinates. All matter is of course also free falling. This can be formally shown by means of the 
geodesic equation. Since for the Robertson- Walker metric Tq = 0, comoving observers satisfy the 
geodesic equation 

d 2 x v dx» dx a 

where r pr is the proper time for the comoving observers, which are thus free falling. Moreover, for 
comoving observers, the line element (1.7) reduces to ds 2 = —c 2 dt 2 . Since they follow timelike world 
lines (like every physical observer), their lapse of proper time dr pT is related to the line element by 
c 2 dr 2 r = —ds 2 and we get the simple relation between cosmic time t and proper time r pr for comoving 
observers 

dt = dr P r . (1.11) 

This means that for comoving observers (and thus for galaxies) the cosmic time t in the Robertson- 
Walker metric is just their proper time r pr as given by a standard clock in their rest frame. This also 
means that if we synchronize a set of comoving observers on a slice of constant time, they will stay 
synchronized on every subsequent slice as time goes on. 

The physical distance (or proper distance) d pi (t) between two points x\ and X2 on the slice 
t is defined by the physical length of the shortest connection on the slice between them. While it 
might be complicated to find and parametrize this connection for two arbitrary points X\ and X2, 
we can simplify this problem substantially by taking advantage of the underlying symmetry of the 
Robertson- Walker metric and choosing coordinates such that x\ = (homogeneity) and x-i = (r, 0, 0) 
(isotropy). With the parametrization x{\) = (A,0,0) 8 , the physical distance d pr {t) is obtained by 



r \ dx 1 dxi r 

d pr (t) = a(t) J ^.(A,0,0) — — d\ = a(t) J d\ = a(t)r. (1.12) 

This confirms the interpretation of the coordinate r as the standard radial coordinate (up to the scale 
factor). Moreover, Eq. (1.12) tells us that the physical distance of any two comoving observer scales 
with the time dependent scale factor a(t). Assuming a(t) was a monotonically increasing function 
of t as indicated by the redshift of galaxies (see Section 1.4.1), it follows that any two galaxies in 
the universe are receding from each other and all separations between galaxies increase by the same 
factor with time. This global, coherent motion is called Hubble flow and describes the expansion 
of the universe. 9 It is often convenient to measure distances irrespective of the expansion of the 



Note that for K = 1 there are always (at least) two possibilities being the great circle segments between the two 
points. In this case we take the shorter one. 
9 It is important to note that the expansion of the universe not only means that galaxies are receding from each other, 
but rather that the universe as a whole is growing. For instance, in the case of K — 1 the proper volume of the 
universe is given by V = 2n 3 R :3 (to)a 3 (t), thus it is finite and grows proportionally to a 3 (t). 
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universe. So we define the comoving distance d(t) as 




(1.13) 



Note that the comoving distance between comoving observers is constant and equal to the proper 
distance at the time to. 



1.3. The Friedmann equations 

In a homogeneous and isotropic universe, the dynamics of spacetime and matter are determined 
solely by the scale factor a(t). In order to determine the scale factor, we have to know the content 
of the universe in form of the energy momentum tensor and solve the Einstein field equations. 

1.3.1. Field equations and equation of motion 

Fortunately, the symmetries of the universe not only set strong constraints on the metric g^u, but also 
on the energy momentum tensor. Since spatial coordinate transformations affect only the i = 1,2,3 
components of T^, it follows immediately that Too transforms like a 3-scalar, To like a 3-vector, and 
Tij like a 3-tensor under such transformations. 10 

Moreover, since has the same transformation behavior as the metric tensor g^ u and the latter 
is substantially restricted by the symmetry of the spatial slice as shown in the previous section, the 
same restrictions hold for TJ^,. It can be proven (see Weinberg 1972, Sect. 13.4) that Too, To, and 
Tij must take the form 

T o = p(t)c 2 , T = 0, T ij =p(t)g ij , (1.16) 

where the functions pit) and p(t) can depend only on t. However, this means nothing else than that 
the energy momentum tensor takes automatically the form of an ideal fluid 



(1.17) 



where the function p(t) gets the interpretation of the matter density, p(t) that of the pressure 11 , and 
where = — u M = (c, 0, 0, 0) is the 4- velocity of the fluid in comoving coordinates (which is vanishing 



I The general transformation behavior 

W = T^)^§^ (1.14) 

under a spacetime coordinate transformation x — > x' reduces for purely spatial coordinate transformations (ct, x) — s- 
(ct, x') to 

flx i fir* r>T J 

% (t,x')=T m (t,x), r j0 (t,x')=T i0 {t,x)^ } , T' kl {t,x')=T ij (t,x)^^ l . (1.15) 

That is the transformation behavior of a 3-scalar, a 3-vector, and a 3-tensor respectively. 

II This term should not connote that an energy component taking the form of an ideal fluid always features a pressure 

that could accomplish mechanical work (like moving a wall). For instance, a gas of weakly interacting relativistic 
particles such as neutrinos exhibits a pressure p = pc 2 /3 and yet could hardly move a wall. Therefore the term 
"pressure" here is rather a property of the system regarding its momentum distribution. 
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for a comoving fluid). Also note that the universe may consist of several ideal fluids = X^/P/W 
for / = 1, . . . , N, since the sum of an ideal fluid is also an ideal fluid. 
The Einstein field equations are 



G 



8ttG 



L fJ.U ) 



(1.18) 



where G^ u is the Einstein tensor for the Robertson- Walker metric (1.7) and T^ u is the energy mo- 
mentum tensor (1.17). Computation of the Einstein tensor is straightforward but tedious. App. 2.3 
of Durrer (2008) provides a table of all geometrical quantities of interest for the Robertson- Walker 
metric. Making use of them, we immediately find the Friedmann equations 12 





)'- 


8itG 


Kc 2 




3 9 


R 2 a 2 


H — H 2 = 


a 


4itG 




a 


3 



(1.19) 
(1.20) 



where a dot denotes the derivative with respect to cosmic time t. In these equations we also introduced 
the Hubble parameter H(t) defined by 

d(t) 



H(t) 



a(t) 



(1.21) 



The equation of motion for the ideal fluid (1.17) is given by the general relativistic energy- 
momentum conservation 

V U T^ = d v T^ + T% v T0 v + T^T^ = . (1.22) 
Again using the table in the App. 2.3 of Durrer (2008) we obtain 13 



3H[p + 3 



P 



1.23) 



Note that this equation is not independent from the Friedmann equations, but could be derived 
from them. However, if the energy momentum consists of many separate fluids T^ v = Xl/P^W f° r 
/ = 1, . . . , N, which are non- interacting (except for gravity), then the equation of motion holds for 
each fluid separately, i.e. 



pj = -3H (p 7 + 3g) , I=1,...,N. 



(1.24) 



This is information which could not be obtained from the Friedmann equations. We will always 
assume that the fluids considered are non-interacting. The total density and total pressure are 
obviously just the sums of the different components, i.e. p(t) = Yli Pi(i) an d p(t) = YliPi(t)i 
respectively. 

Throughout this introduction, we will denote every time dependent quantity evaluated at the time 
to for which a(to) = 1 by a subscript 0. For instance, Hq = H(to) is the Hubble constant and 
i?o = R(to) is the curvature radius of the universe at time to i n the case of K ^ 0. A model of the 
universe that is described in the framework of the Robertson- Walker metric and whose dynamics 
are determined by the Friedmann equations is called a Friedmann-Lemaitre- Robertson- Walker 
(FLRW) universe or FLRW (world) model. 14 



12 The first equation is the G° = 8tyG T° component and the second equation is the trace G l i = 8tyG T\. 
13 This equation is obtained by the V M T° M = component, while the V M T IM = components just yield dp(t)/dx l — 0, 
i.e. homogeneity. 

14 We include Georges Lemaitre in this acronym for his substantial contributions to the early development of these 
cosmological models. For a historical review we refer, for example, to Nussbaumer & Bieri (2009). 
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1.3.2. Equation of state 

In order to solve the Eqs. (1.19) and (1.24), we have to know how pi(t) and pi(t) are related for each 
separate fluid component. These relations are usually expressed by the equation of state 

«*<«> = ^ (!■*> 

for each component. If wj(t) is known for every fluid component and if the fluids are non-interacting, 
then we can solve the Friedmann equation for given initial conditions pjQ and for a given K . That 
is the Friedmann equation (1.19) together with the Eqs. (1.24) and (1.25) form a closed system of 
equations. Note that for given initial densities pio there is in general a (locally) expanding and a 
(locally) contracting solution due to the square on the left hand side of the Friedmann equation 
(1.19). 

If the fluid I with wi(t) is non-interacting with the other fluid components, pi(t) is determined by 
Eq. (1.24) and has the general solution 

Pl (t) = p I0 a (t)-3H-HW*)] , WeSl{t ) = -±- f Ha) ^ da . (1.26) 

ln(a) J a 

For a constant equation of state, i.e. wj(t) = wj, this reduces to 

Pi(t) = p 10 a(t)-^ 1+WI K (1.27) 



For a fluid of weakly interacting non-relativistic "particles" (e.g. DM, galaxies) holds w = 0, while 
for a fluid of radiation or relativistic particles holds w = 1/3. 15 A fluid with wj = —1 is special in 
the sense that it has constant energy density with time, i.e. pj{t) = pj , thus such a fluid can be 
interpreted as a property of space itself. Formally, it is equivalent to the inclusion of a cosmological 
constant term in the field equations: 

G^^G^ + g^K. (1.30) 

The cosmological constant A is then related to pi and pi by 

Ac 2 Ac 4 
Pio = ^ , W° = -W 2 = -^, (1-31) 



15 This can be easily seen in the special relativistic limit by representing the fluid as a set of N point particles (e.g. DM 
particles, galaxies, photons). The energy-momentum tensor of the fluid is then given by (Weinberg 1972, Sect. 2.10) 

where Xi(t) is the world line of the ith particle and = ([p»] ,Pj) its 4-momentum. Interpreting this expression 
as an ideal fluid (see Eq. (1.17)), it follows 

AT 2 N 

P=^ = ^§y) S3 ( x ~ x *)> pc 2 =T 00 = cJ2[P*?S 3 (x- Xt ). (1.29) 

With [pij^lpij^ = — [pt]°[pi] + Pi — —mc 2 it follows for such a fluid in general < p < pc 2 /3. Moreover, for 
non-relativistic particles, i.e. p 2 <JC mc 2 , it holds p <!C pc 2 /3, and for relativistic particles, i.e. p 2 ^> mc 2 , it holds 
p ~ pc 2 /3. 
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and the Friedmann equations correspondingly become 



8vrG 



Kc 2 
R 2 a 2 



+ 



Ac 2 



AttG 



+ 



Ac 2 



(1.32) 



where p and p do not include the Ith component anymore. 

For a flat universe, i.e. K = 0, that is governed by a single energy component T^ u with a constant 
equation of state w, the time evolution of the scale factor can be explicitly given. 16 Let be an 
arbitrary epoch. For w > — 1 and with Eq. (1.27), the function 




(1.33) 



is an expanding solution of the Friedmann equation (1.19), and yields the Hubble parameter (1.21) 
as 



H(t) 



a(t) 



1.34) 



a(t) 3{1 + w)t 

The origin of the time coordinate t = has been chosen such that the scale factor vanishes at that 
epoch, which in our simple model marks the beginning of the universe ("big bang"). Thus a flat, 
expanding universe with a single energy component (w > —1) has a beginning and will expand at 
any time, where it follows from Eq. (1.34) that its age is given by the inverse of the corresponding 
Hubble parameter (up to a constant of order unity). On the other hand, for w = — 1 the energy 
density is constant (see Eq. (1.27)) and so the Friedman equation (1.19) has the solution 



a(t) = a (U) e Ho{t ~ u) 



(1.35) 



with Hq = H(t*) = Y / 8vrGp(t*)/3 = const. In this case the scale factor never vanishes and the 
universe is formally infinitely old. While Ho is a free parameter for the solution (1.35), it is entirely 
specified for the solution (1.33) by Eq. (1.34), since we have fixed a(0) = 0. So the only remaining 
free parameter in the latter case is Ro, which however for a flat universe is just an arbitrary scaling 
with no observational consequences. Hence the evolution of a flat, expanding universe with a single 
energy component (w > —1) has effectively no degree of freedom. 

For a flat universe that is governed by several fluids Tj V with different, but still constant equations 
of state wi, the expansion history is slightly more complicated and can in general be only computed 
numerically. However, for a certain time interval ("era") between t{ and if, when the universe is 
dominated by the Ith energy component, i.e. 



T^(t) ~ [Tj]^{t) , ti<t<t t , 



(1.36) 



we can find approximate solutions. For wi > — 1 , the scale factor a(t) for an expanding universe is 
approximately described by 

/ t-t \ 2 IW 1+W '^ 
a(t) ~ a(t m ) - 2 , (1.37) 



t-t 

where t is a time shift that is determined by the Friedmann equation (1.19) for t = t m yielding 



1 



8vrG 



// 2 !/,, ; i ::>!/:„) 

3 (t m - t)(l + w)J •! 



(1.38) 



16 In Section 2.3.1 we discuss the solution of an overcritical, i.e. K = 1, universe for w — 0. 
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and i m is a fixed epoch between t\ and if. For wi = — 1 we have instead the approximation 

a(t) ~ a(i i )e H( * i)( ^* i) . (1.39) 

Thus a comparison of the approximations for the multi- component system to the solutions of the 
single-component systems (see Eqs. (1.33) and (1-35)) shows that the difference is just a time shift 
t. For a given era that is dominated by the component J, this shift is usually so small relative to 
the corresponding age of the universe t m that it can be neglected. Hence for many applications it is 
sufficient to just use the Eqs. (1.33) and (1.35) even in the case of a multi-component system. 

What can we say about the acceleration d(t) of the universe? The second Friedmann equation (1.20) 
tells us that, irrespective of the curvature, the expansion of the universe is decelerating, i.e. d(t) < 0, 
if the universe is dominated by an equation of state wi > —1/3, and accelerating, i.e. d(t) > 0, if it 
is dominated by an equation of state wi < —1/3. Thus a fluid with wi < —1/3 has the remarkable 
property to act repulsively by gravitation. This means it violates the strong energy condition which 
requires physical fluids to satisfy pj + 3pi > 0. 



1.3.3. Density parameters 



In order to study and compare different cosmological models, it is convenient to introduce the di- 
mensionless density parameters defined as 



Piit) 
Pc(t) ' 



Pc(t) = 



3H 2 (t) 
8ttG 



(1.40) 



where p c (t) is the critical density. The first Friedmann equation (1.19) then simplifies to 

N 



i = - 



Kc z 



i=i 



H 2 (t)R 2 a 2 (t) 



= n K (t) 



(1.41) 



where ^x(i) is the curvature density acting phenomenologically like a fluid with an equation of 
state wk = — 1/3 (cf. Eq. (1.27)). Note that such a fluid does not contribute to the second Friedmann 
equation (1.20) due to pic(t) + 3px(i)/c 2 = 0. Moreover, it follows from the definition that 



n K (t) < o k = i , n K (t) = k = o , n K (t) > o k = -1 , 



(1.42) 



and tlK(t) cannot change the sign during the evolution of the universe. The first Friedmann equation 
then takes by construction the very compact form 



N 



i=i 



(1.43) 



The cosmological models are usually characterized by the present day values of the density param- 
eters. For ease of notation we will omit the subscript for the values of the density parameters at 
to, i-e. if no particular epoch is indicated, it holds Qj = fij(io) and VLk = Oft-(io)- Assuming that 
the different fluid components / are not interacting with each other, the evolution of each density 
parameter can be expressed using Eq. (1.27) as 



-3(l+w/) 



Hp 
H(t) 



n K (t) = n K a - 2 (t) 



Hp 
H(t) 



(1.44) 
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so that the Friedmann equation (1.43) becomes 



H(t) = H 



N 



\ i=i 



(1.45) 



1.4. Observational cosmology 

To connect the theoretical framework developed in the previous two sections with astronomical 
observables, we have to understand how photons behave in this framework. This directly leads to 
the redshift of galaxies, which is probably the most important observable of extragalactic astronomy. 



1.4.1. Cosmological redshift 

Consider a photon being emitted by a distant galaxy at the spatial coordinate X\ that arrives at 
the Earth being at xq at the present epoch. Without loss of generality, we can assume that the 
Milky Way lies at the origin of the spatial comoving coordinate system, i.e. xq = 0, and that the 
distant galaxy has the comoving coordinate x\ = (r, 0, 0). Like in special relativity, the world line of 
a photon in general relativity is characterized by ds = due to the principle of equivalence, so with 
the Robertson- Walker metric (1.7) it holds for a photon coming toward us 

dr = %dt. (1.46) 

Now consider two wave crests of the photon leaving the distant galaxy at t and t + dt, respectively, 
and arriving the Earth at to and to + Sto, respectively. Since the two galaxies are at fixed comoving 
coordinates x\ and xq, the comoving distance traveled by the crests of the photon is the same for 
the two crests, i.e. 

rto „ rto+Sto 



/•to r rto+oto „ 

- / -±-dt = - / -^dt . (1.47) 
Jt Jt+st <*) 

This leads to 

f to+Sto c , f t+5t c , cSt cSt 

= / —-dt- — dt ~ —4 — , (1.48) 

J to a(t) J t a(t) a t a(t) y 1 



where the last approximation is very accurate since dt ~ dto ~ 10" 14 s for visible light. Since St and 
Sto are just the periods of the wave of the photon at the epochs t and to respectively, it holds for the 
frequencies of the emitted photon, i.e. v em = 1/St, and the observed photon, i.e. v ^ s = 1/Sto, 



(1.49) 



This means that a photon experiences a frequency shift inversely proportional to the expansion of 
the universe during its journey. 17 The new introduced quantity z is called cosmological redshift, 
if the frequency shift is towards smaller frequencies, and cosmological blueshift, if the frequency 
shift is towards larger frequencies. Since essentially all galaxies exhibit a redshift, we will call z just 




17 Our derivation was slightly heuristic. For a more formal derivation by means of the collisionless relativistic Boltzmann 
equation see e.g. Durrer (2008, Sect. 1.3.3). 
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(cosmological) redshift. Thus assuming that a galaxy is a comoving observer and that there is no 
other contribution to its redshift, the ratio a(to)/a(t) between the emission and arrival of the photon 
can be measured by studying the spectrum of the galaxy. Unfortunately, as we will see in the next 
section, for instance the fact that galaxies are not perfect comoving observers produces an additional 
redshift contribution, which cannot be disentangled from the cosmological one. But if not mentioned 
otherwise, we assume that the redshift is purely cosmological. 

Eq. (1.47) shows that for a photon arriving at the Earth at the present epoch there exists a 
one-to-one correspondence between the comoving distance d = r (see Eq. (1.13)) and the emission 
epoch t of the photon. Then there is a one-to-one correspondence between the scale factor a(t) 
and the emission epoch t, if a(t) is a monotonically increasing function, and there is a one-to-one 
correspondence between the scale factor a and the redshift z given by Eq. (1.49). So finally, if the 
universe is monotonically increasing, there exist one-to-one correspondences between any pair of the 
four quantities d, t, a, and z, and we can always express any of these quantities as a function of 
any other. To derive the relation d(z) explicitly, we again consider the world line of a photon (see 
Eq. (1.46)) 

c c c 

ait) dr = —cdt = — da = — - - , — dz = — - a(t) dz , (1.50) 

1 ' a(t) a(t)H(t) dz H(t) w v ; 

where t is the emission epoch for a photon and where we have used da(z)/dz = — l/(l + 2;) 2 = — a 2 (t). 
So it holds 

r ~/v 



dr = -rjr^dz , H{z) = H { 



Hjz) ' n \ z ) = n ^^2^ 



Y,^i(^ + z) 3{1+Wi) + ^k(1 + z) 2 , (1.51) 



where the explicit expression for H(z) = H(t(z)) is immediately obtained by using Eq. (1.45) and 
the definition of the redshift. The comoving distance to a galaxy with redshift z is then 



i, I i== 1 = dz . (1.52) 

Jo * '££=1 fii (i + z)^+^) + n K (i + zf 



Surveys encompassing thousands or even millions of galaxies have shown that essentially all galaxies 
exhibit a redshift meaning z > 0. In the light of Eq. (1.49), this is direct confirmation that a(t) was 
indeed smaller when the observed galaxy photons were emitted. Today, it is well established that 
there exists a one-to-one correspondence between the distance and the redshift of a galaxy. On the 
one hand, this is directly demonstrated with the aid of supernovae la acting as "standard candles" 
up to redshifts of z ~ 1, and, on the other hand, it is a consequence of the concordance cosmology 
to be introduced in the next section. This justifies a posteriori our assumption of a(t) being a 
monotonically increasing function of time. 



1.4.2. Peculiar velocities 

As mentioned in Section 1.1, our universe is only homogeneous for length scales > 100 Mpc. On 
smaller scales, however, it is strongly inhomogeneous, which leads to deviations from the overall 
Hubble flow of the order of a few 100 km/s. These deviations are termed peculiar velocities. The 
radial components of the peculiar velocities of galaxies add a contribution to their total redshift 
by means of the Doppler effect at the position of the galaxies which observationally cannot be 
disentangled from their cosmological contribution defined by Eq. (1.49). 

Consider a galaxy which resides at the position corresponding to the cosmological redshift z cos and 
which has a peculiar velocity 5v in radial direction. Due to the local Doppler effect a photon emitted 
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from that galaxy in our direction as observed by a comoving observer at the position of the galaxiy 
is redshifted by 

l + * = ^ = VT^ a!l + T (L53) 

for |<5v/c| <C l, where v is the frequency of the photon in the restframe of the galaxy and v' the 
frequency observed by the comoving observer at the position of the galaxy. 18 After its emission the 
redshifted or blueshifted photon travels all the way to the Earth and is further redshifted due to the 
expansion of the universe, i.e. v' jv" = 1 + z cos , where v" is the frequency of the photon arriving at 
the Earth. Thus the total, observable redshift z of the galaxy is 

1 + Z= ^ = ^ = (1 + -Zees) (1 + Zp) ^ (1 + ^cos) U + — ] , (1-54) 
v" v' v" \ C J 

leading to the redshift perturbation 

5v 

Sz = Z - Z cos — (1 + Zcos) — • (1.55) 

c 

If the observed redshift is interpreted as purely cosmological, this redshift perturbation produces a 
spurious displacement 51 of the galaxy along the line of sight of the order 

dd c 1 ~\~ z 

51 = d(z) - d(z cos ) ~ ^-(zcos) 5z = — r Sz ~ — cos 5v . (1.56) 

Therefore, in observational cosmology one has to distinguish between the ideal real space, where 
the true distances of galaxies are known, and the observable redshift space, in which distances are 
inferred from their observed redshift. 

Peculiar velocities are particularly prominent in groups (and clusters) of galaxies. A galaxy group 
is a gravitationally bound system (typically associated with a DM halo, see Sect. 2.3) containing 
several galaxies and other forms of matter, which are moving in the gravitational potential of the 
group. Due to the gravitational boundedness, the system is decoupled from the Hubble flow and hence 
photons moving through the group are not affected by the expansion of the universe until they leave 
the group. Thus the observable redshifts of galaxies within groups consist of the redshift z gT of the 
group as a whole 19 and their line of sight peculiar velocities within the group. As a first consequence, 
the redshift of the galaxies does not contain any information about the line of sight position of the 
galaxy within the group. As a second consequence, the peculiar velocities of galaxies in groups lead 
to an elongated shape of groups along the line of sight in redshift space, as only the components 
of the peculiar velocities parallel to the line of sight contribute to the redshift perturbations. Since 
these elongations are always pointing toward us, they are termed fingers-of-god. If a group with 
redshift z gI has a line of sight velocity dispersion ct v , i.e. the standard deviation of the line of sight 
components of the peculiar velocities of its galaxies, its finger-of-god has the comoving radial length 
(see Eq. (1.56)) 




(1.57) 



18 Note that only the radial components of the peculiar velocities lead to redshift perturbations, as the transverse 
Doppler effect caused by the velocity components perpendicular to the line of sight can be neglected due to the 
non-relativistic motions of galaxies. 

19 This redshift can be purely cosmological or it can itself feature a peculiar velocity, if the whole group is moving with 
respect to the Hubble flow. 
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where cr z denotes the redshift dispersion of the galaxies. This is a convenient formula for measuring 
the velocity dispersions in groups by means of their redshift distribution. This example illustrates 
that peculiar velocities of galaxies can be both a blessing and a curse; they obscure the real positions 
of galaxies, but provide us valuable information about the dynamics of galaxies. 



1.4.3. Horizons 

The distance that photons can travel in the universe during a given time interval defines the radius 
of causality, within which information can propagate during the time interval. This radius is called 
horizon. In cosmology, there are two kinds of horizons of interest. 

The particle horizon at time t is the distance that a photon can travel from the beginning of 
the universe up to this time. This means that a point in space is causally connected only to the 
region within its particle horizon. This region with us being a the center is called the observable 
universe. With Eq. (1.46), the comoving particle horizon d p (t) at time t is given by 



(1.58) 



where U is the beginning of the universe. If the universe is flat with a single ideal fluid component, 
we can set t\ = and use Eq. (1.33). It follows immediately that for an equation of state w > —1/3 
the particle horizon is finite and takes the value 

t c c 1 

dp(t) = W) ^3(t£) = ^M*) §(! + ") -1 ' (L59) 

where in the last step we have used Eq. (1.34). On the other hand, for w < —1/3 there is no particle 
horizon (i.e. it is infinite) even though the age of the universe is finite. Note that formally the particle 
horizon corresponds to redshift z = oo (see Eq. (1.49)). 

The event horizon d c is the distance that a photon can travel from now until the end of the 
universe if. This means that a photon emitted at the present epoch from a galaxy outside our event 
horizon will never reach us even if we wait infinitely long. Formally the event horizon d e (t) is 



de(t) = f 



a(*; 



dt' . 



(1.60) 



By means of a similar argument like for the particle horizon, we find that the event horizon for a flat 
universe with a single component with an equation of state — 1 < w < — 1/3 is finite and given by 

de(t) = W) 3^-1 = wm l-ia+u,) ' (m) 

where for such models the end of the universe is tf = oo. (For the case w = — 1 we have used 
Eq. (1.35).) For w > —1/3 there is no event horizon, i.e. any photon emitted at the present epoch 
will reach us at some time in the future. 

Note that both the comoving particle horizon as well as the comoving event horizon are essentially 
dn(t) = c/(Ha) times a numerical constant of order unity. This shows that dn is the typical length 
scale in a FLRW universe at time t and we call du the comoving Hubble length. We will often 
approximate either horizon by this quantity. The proper Hubble length is then just d^(t) = d^a = 
c/H. Note that for w = — 1 the proper Hubble length is constant, since in this case H{t) = Hq is 
constant, and identical to the proper event horizon. 
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Table 1.1. Present day cosmological parameters as obtained by the WMAP 7- year data set 
combined with supernovae la data, and acoustic baryonic oscillations (Komatsu et al. 2011). 



Parameter Present day value Equation of state 



Hubble constant 


h = 0.702 ±0.014 






baryonic matter 


Q h = 0.0458 ± 0.0016 


w h = 





dark matter 


n d = 0.229 ±0.015 


w d = 





dark energy 


n A = 0.725 ±0.016 


WA = 


-0.980 ±0.053 


curvature density 


-0.0133 <n K < 0.0084 


W K = 


= -1/3 


photons 


n y = 2.47 x 10~ 5 h~ 2 


Wry = 


1/3 


neutrinos 


VL V = 1.71 x 10~ 5 h- 2 


w u = 


1/3 


spectral index 


n s = 0.968 ± 0.012 






linear fluctuation amplitude 


o-g = 0.816 ±0.024 







Note: The values and errorbars of the parameters h, fib, fid, ^A, "-s, and as corre- 
spond to the mean and 68% confidence limits (CL), respectively, of the marginalized 
distributions after fitting the data to a flat 6-parameter ACDM model. To estimate 
wa the cosmology was kept flat and only WMAP and supernovae data were used, 
and to estmate fix an equation of state wa = — 1 was assumed. The errorbar of wa 
corresponds to the 68% CL and the one of fi^- to the 95% CL. The estimation of fi 7 
and fij, is described in Sect. 1.5.1. 



1.5. The concordance model 

In the last three sections we have developed the FLRW framework for a general expanding universe. 
Fortunately, the growing amount of observational data in astronomy, particularly the CMB and 
huge galaxy surveys, have allowed the determination of the constituents of the universe Tj" and the 
present day values of the cosmological parameters (e.g. Ho, fij, fix) to the impressive precision of a 
few percent. This led to the currently favored concordance model, which is a flat universe whose 
energy budget at the present epoch is dominated by some sort of exotic "cold dark matter" (CDM) 
and exotic "dark energy" in the form of a cosmological constant A, where exotic refers to the fact 
that these constituents must represent physics beyond the standard model of particle physics and 
could not yet be observed in human made experiments. Due to these two main contributions, the 
concordance model ist also called ACDM model. 

A summary of the current values of the present day cosmological parameters as obtained by a 
combination of the CMB WMAP 7-year and complementary data sets (Komatsu et al. 2011) is given 
in Table 1.1, where the Hubble constant is parametrized by means of h as H = 100 /i kms -1 Mpc" 1 . 
The density parameters fib, fi m , Qa, Q-y, and £l v will be discussed in detail in the following section, 
and the cosmological parameters n s and as, which describe the clustering in the universe, will be 
introduced in the Sections 2.2.2 and 2.2.3 respectively. 

One of the most striking properties of our universe is that its geometry is essentially flat, i.e. (see 
Tab. 1.1) 

Q K ~0. (1.62) 



On the one hand, within a flat universe the formalism developed so far (and the one to be developed 
in the other chapters) simplifies a lot. From now on we will stick to the case of a precisely flat 
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universe. On the other hand, without a reason at hand why the universe should be flat this result is 
somewhat surprising and became what is known as the "flatness problem" , which will be discussed 
in Section 1.6.1. 



1.5.1. The content of the present day universe 

We see from Table 1.1 that the energy content of the present day universe mainly consists of 



n A ~ 0.73 , fi d ~ 0.23 , fi b ~ 0.04 , 



(1.63) 



where Qa corresponds to dark energy, to cold dark matter (DM), and fib to baryons 20 . Thus 
dark energy is the dominant energy contribution of the present day universe and acts repulsively due 
to its equation of state being smaller than —1/3, i.e. dark energy is responsible for the observed 
acceleration of the universe, which was directly observed for the first time by means of supernovae 
la (see Weinberg et al. 2012 for a review about observational probes of the cosmic acceleration). 
Moreover, all measurement are so far consistent with w\ = — 1 (this is the reason why it has been 
given the subscript of the cosmological constant A) , and yet there is no clue from fundamental physics 
what dark energy could be. Hence dark energy is not only the dominant, but also the most mysterious 
component of the universe. DM being the second ranked dominant energy contribution at the present 
epoch must constitute some sort of non-baryonic, cold (i.e. non-relativistic), very weakly interacting 
massive particle (WIMP), which has not been detected in laboratories yet. Its non-relativistic nature 
can be inferred from the observed cosmic structures at small scales, and its non-baryonic nature is 
a consequence of the theory of nucleosynthesis in the early universe (see the next section for a brief 
history of the universe). Since baryons and DM have the same equation of state, it is meaningful for 
many applications to add them together yielding the total matter density 

n m = n d + n h ~ 0.27 . (i.64) 

The universe also contains energy in the form of relativistic particles such as photons and neutrinos. 
By far the biggest contribution to the energy density of such particles comes from the equilibrium 
distribution produced in the early universe. The energy density for a relativistic particle species of a 
given temperature T is 

vr 2 . „ (kT " 



"< T ' = 9 3o teT (&J • (L65) 

where g denotes the effective number of degrees of freedom of the particle. 21 With g 7 = 2 for photons 
and the current CMB temperature of T 7 = 2.725 ± 0.002 K (95% confidence, Mather et al. 1999) we 
obtain the photon density 

fi 7 = U ^ )/c2 ~ 2.47 x 10~ 5 h~ 2 . (1.66) 

Pc 

Since for the neutrinos it holds g v = 2 x 7/8 and since the temperature of the neutrino background is 
predicted to be T v = (4/ll) 1 / 3 T 7 = 1.95 K by the theory of the early universe, the neutrino density 



Unlike in particle physics, in cosmology "baryons" also comprise electrons, i.e. the term just refers to "normal matter" 
of which gas, stars and planets etc. are made in contrast to more exotic matter like neutrinos and DM. 

For a derivation of this formula we refer, for instance, to Mukhanov (2005, Sect. 3.3). The effective number of degrees 
of freedom for a photon is g-, — 2 due to the two polarization states and g v — 2 x 7/8 for each neutrino species. In 
the latter case the factor 2 is due to existence of neutrinos and anti-neutrinos and the factor 7/8 must be included 
for all fcrmionic particles. 
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is 



n) 



4/3 



iV„ ^ ^ 1.71 x 1(T 5 /T 2 



(1.67) 



with TVj/ = 3.04 the standard value for the effective number of neutrino species (Komatsu et al. 2011). 
Note that in contrast to the CMB and its temperature, the expected cosmic neutrino background 
could not yet be detected, but is predicted by the cosmological model of the early universe, which 
will not be discussed in this introduction. The sum of the photon and neutrino densities yields a 
total radiation density of 



5 u-2 



ttj. = 7 + n v ~ 4.2 x 10~ & h 



(1.68) 



Comparing this value to those in Eq. (1.63) shows that the radiation density is negligible in the 
present universe, and so the relation (1.52) between comoving distance d and redshift z simplifies for 
the redshift range accessible by optical astronomy, i.e. z < 10, to 



d(z) = f 

J 



H(z) 



dz 



h(z) = H oy /n A + n m (i + z)z . 



(1.69) 



1.5.2. History of the universe 

With the cosmological parameters specified in the previous section we are able to give an outline 
of the history of the universe. Details can be found in almost any textbook of cosmology. We will 
follow the summary in Mukhanov (2005, Sect. 3.2). 

At the present time the universe is mainly driven by Q\, while radiation Q r is entirely negligi- 
ble. However, due to the different equation of states the relative importance of the different energy 
constituents change with time according to Eq. (1.27). The larger wi the more important is the cor- 
responding energy constituent at earlier times. So there was a time when the universe was radiation 
dominated, then there was a time when it was matter dominated, and now it is about to become A 
dominated. 

The model of the history of the universe is obtained by extrapolating the current state back in 
time and feeding it with the inputs from observations. By doing this we find a remarkably consistent 
model back to the time when the universe was about 10 -5 seconds old. The basic idea is that the 
universe becomes smaller and smaller as we go back in time and so the matter density higher and 
higher. At an early enough point in time the universe was so dense that matter and radiation were 
in the state of a plasma and the different energy contributions cannot be treated as non-interacting 
anymore (e.g. Eq. (1.24)). The further we go back in time the higher the temperature and the 
more particle species are being created and interact with the plasma. At about 10~ 5 seconds or 
equivalently at a temperature of T = 200 MeV/fce the quark-gluon transition takes place, which is 
not fully understood yet. Going much further back in time leads to the problem that we cannot 
probe the physics anymore in our accelerators, since the involved particle energies become too large. 
In the following we will outline the main stages in the history of the universe: 

Very early universe (<10~ 14 s) For Energies > 10 TeV we have no clue about the physical in- 
teractions from accelerator experiments and so any model for this stage of the universe will 
necessarily be very speculative. This is the era where hypothetic processes, such as the origin 
of baryon asymmetry or inflation, might have taken place. 
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Early universe (~ 10~ 5 — Is) After the energy dropped to 200 MeV, the quark-gluon transition 
takes places: free quarks and gluons become confined within baryons and mesons. This is the 
starting point from when we understand the history of the universe in great detail. In this era 
the universe is a hot plasma where many particle species (e.g. electrons, neutrinos, photons, 
baryons) are in thermal equilibrium with each other. As soon as the inverse of the interaction 
rate for a given particle species with all the other species exceeds the characteristic timescale 
1/H(t) of the universe, the corresponding species "freezes out" and remains as a thermal relict 
from the early universe. As the energy reaches ~ 0.5 MeV only electrons, photons, protons and 
neutrons remain in the plasma. All other particles froze out. 

Nucleosynthesis (~3 — 5 min) At energies of ~ 0.05 MeV nuclear reactions become efficient, so 
that free protons and neutrons form helium and other light elements. 

Matter-Radiation-Equality (t eq ~ 60,000 yr) This is the epoch when the energy density of matter 
and radiation was equal, i.e. Q, r (t cq ) = ^m(^eq)- Before this epoch the universe was radiation 
dominated and afterwards matter dominated. 

Recombination (idee ~ 380, 000 yr) Electrons and positrons recombine to form neutral atoms. The 
universe becomes transparent and the cosmic microwave background (CMB) is released as a 
cosmic relict. This process is also called decoupling and corresponds to a redshift z^cc — 1089 
with a thickness of Az ~ 200 (Bennett et al. 2003). 

Structure formation (~0.1 — 13.7 Gyr) As time goes on, tiny fluctuations in the distribution of 
matter start to grow under the action of gravity leading to the LSS at the present day to ~ 13-7 
Gyr. This is the topic which will be discussed in detail in the next three chapters. 

These different stages are, of course, not totally separated from each other, but blend and interact 
leading to a complicated history of the universe, which can in detail only be modeled by numerical 
simulations. As we will see in the next chapter, DM fluctuations can efficiently start to grow as 
soon as the universe becomes matter dominated, while the baryonic fluctuations are prevented from 
growing due to the interaction with the photons (and remain at the temperature of the photons even 
for some time after recombination). 

What is the age of our universe? To answer this question we first need an event that we can 
interprete as the beginning of our universe and then we need the full knowledge about the expansion 
history of the universe since that beginning. But as discussed before, we have no firm knowledge about 
the physics in the very early universe and so any model of the universe at that time is unavoidably 
very speculative. Since we can only count back as long as we understand the universe, it is reasonable 
to define the beginning of the universe as the epoch, when the scale factor formally becomes zero 
at very early time during the radiation dominated era. This is the time coordinate that we used in 
the outline of the history of the universe above and according to this definition of the beginning, the 
universe basically starts with inflation, which is a meaningful starting point as will be discussed in 
the next section. However, the questions whether inflation really took place and, if yes, what was 
before inflation, cannot be answered today, and it is open whether we will ever be able to answer 
them. 

With the definition of the origin of the time coordinate t at hand, the present age of the universe 
can be computed as follows. With Eq. (1.50) we have 



dt = 



1 



da 



1 



da 



(1.70) 



d(t) 



a(t)H(t) 
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and so the age of the universe is 

Mt )=i 1 i r 1 1 i 

to= ~^da = — da ~ 13.7 Gyr , (1.71) 

Jo aH H J a ^/Q A + Q mQ -3 + ^ a -4 

where we used Eq. (1.45) and the values in Table 1.1. Note that for this precision it does not 
matter whether or not we consider the contribution from radiation, since f2 r is very small and can 
change the age only about 6 Myr. In fact, the age of the universe at the epoch of matter-radiation- 
equality i cq ~ 56, 000 years is so small compared to the typical age of the universe during the matter 
dominated era that we can safely neglect the radiation and approximate the scale factor during 
matter domination by means of Eq. (1.21) instead of Eq. (1.37), i.e. 

.(.)a^, *M-|}-£. d.«) 

Similarly, to describe the evolution of the scale factor during the radiation dominated era, we can 
neglect the first second, when complicated processes might have taken place, and just write 

.<.)«<■/», Bl t)-m 1.. (,.„, 



1.6. Inflation 

In the previous section it became clear that from energies of about 200 MeV to the present time, 
we have a mostly consistent story of the universe, and the physics of the universe is well understood 
and tested in our laboratories (except DM and dark energy, but we nevertheless understand their 
phenomenological behaviors quite well today). However, as we go further back in time, the story 
of the universe becomes much more fuzzy until we do not know anything safe about the involved 
fundamental physics. Nonetheless, it was possible to propose a consistent scenario for the universe at 
very early times called inflation, which has the potential to produce the known radiation dominated 
early universe from a preexisting chaotic state and to solve a couple of independent shortcomings of 
the concordance model. The basic idea of inflation is that the very early universe underwent a short 
stage of accelerated expansion (i.e. d(t) > 0) driven by a scalar field <fi. 

In Section 1.6.1, we briefly discuss the shortcomings of the concordance model and how inflation 
is able to solve them. Then in Section 1.6.2, we outline the phenomenology of the simplest class 
of inflation models ("slow-roll" inflation). The rather technical formalism of the theory of scalar 
fields is provided in Appendix A to focus on conceptual issues here. The most important aspect of 
inflation in the context of this introduction is that it can produce density perturbations which act 
as the starting point for structure formation in the early universe. The main idea of this process is 
summarized in Section 1.6.3, whereas the technical details are presented in Chapter 4. We finally 
conclude this section with some remarks on the plausibility of inflation and on the light inflation 
shed on the cosmological principle. 



1.6.1. The connection to the concordance model 

There are certain features associated with the concordance model that seem weird. The most famous 
of these features are the "flatness problem" and the "horizon problem" . 
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Flatness problem The curvature density Q,k denned in Eq. (1.41) can be expressed in terms of 
the comoving Hubble length dn(t) (see Sect. 1.4.3) 



where Rq is the curvature radius at the present epoch and thus equal to the comoving curvature 
radius of the universe. That is fifc(i) basically measures the ratio between the Hubble length being 
roughly the radius of the observable universe and the radius of curvature of the universe at time t. 
The smaller |Qk(*)| the less curved space appears within our particle horizon. The value of f^K at 
the present epoch is consistent with a flat universe to a high precision (see Table 1.1). However, 
since our universe was dominated by either radiation or matter during most time of its history, it 
was essentially decelerating and so dn(t) = c/{Ha) = c/a was always increasing with time. This 
means that |Ox(t)| was even much closer to unity in the past. So the question naturally arises why 
the universe was so close to flat in the past or why it is still so flat at present. 

Horizon problem At the epoch of decoupling Zdec ~ 10 4 , the comoving horizon du(t) was so small 
that observed today perpendicular to the line of sight at corresponding distance it would subtend 
only about 1 degree on the sky. Yet the CMB exhibits the same temperature in any direction to a 
precision of 10~ 5 . How is it possible that two causally absolutely disconnected parts in the universe 
can exhibit the same temperature to such a high degree although they never were in causal contact 
and thus never in thermodynamic equilibrium? 

Both problems do not constitute inconsistencies in the framework of the concordance model, but 
the concordance model does not give any clue why the initial conditions should be as described in 
either problem. They give the impression of some sort of fine tuning related to the initial conditions 
of the universe. Yet there are a couple of further similar questions: 

• Why are there no "topological defects" (e.g. magnetic monopoles) in the universe, although 
they are expected by extensions to the standard model of particle physics to be created in the 
very early universe? 

• Why is the universe expanding at all, although it was decelerating during most of its history? 

• How were the density seeds generated in the early universe (along with their characteristic 
power spectrum), which led to the LSS observed in the universe? 

It would be tempting to solve all these problems by a single additional ingredient of the concordance 
model and yet this is exactly what inflation aims to achieve. Inflation being a stage of accelerated 
expansion in the early universe is characterized by the condition d(t) > 0. With 




(1.74) 




(1.75) 



and Eq. (1.4.3) follows the relation 



o(t) > d H (t) < 



d_ 
di 



n K {t)\ <o. 



(1.76) 
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Figure 1.3. Schematic sketch for the evolution of the comoving Hubble length dn during and 
after inlation. Since g?h is decreasing during inflation, a comoving scale I can exit the horizon 
and reenter it after inflation finished and c?h started to increase again. 

From the last expression it is clear that inflation can solve the flatness problem by decreasing |fj#(i)| 
to an arbitrary small value. Since the comoving Hubble length (in is decreasing during inflation, 
also the horizon problem can be solved. To see this, suppose the comoving particle horizon at the 
beginning of inflation was d p (t{) and consider a comoving scale I within this horizon (see Figure 
1.3). Now as the expansion of the universe starts accelerating there starts to exist an event horizon 
d e (t) — dn(t) in the universe. 22 If I < dn(t{) at the beginning of inflation, there will be a time t out 
when the comoving scale I crosses the horizon, i.e. / = dn(t ou t). This means that a scale being in 
causal contact at the beginning of inflation will not be in causal contact anymore at t out , i.e. any 
signal emitted at one end of a scale I at time t out will not be able to reach the other end as long 
as inflation is going on. What happens after the end if of inflation? The expansion of the universe 
starts decelerating again and thus the event horizon vanishes. The region defined by (cf. Eq. (1.58)) 



is the region of causal contact for events that happen after inflation and thus defines some kind of 
"apparent particle horizon" for such events. For instance, a photon that is emitted at some time 
after inflation cannot reach us from distances larger than dn(t) at time t. Since dn(t) is an increasing 
function of t after inflation, each scale I that exited the event horizon during inflation reenters the 
horizon at time a t- in when I = du(ti n ). That is regions that lost causal contact during inflation start 
interacting again. This is why it is possible for the CMB to have essentially the same temperature in 
all directions, although the ^(tdec) at time of decoupling is much smaller than the scales of the CMB 

22 We assume here that we can approximate tf in Eq. (1.60) by oo and that with Eq. (1.61) follows d e (t) ~ dii(t). 
The longer inflation lasts, the better is this approximation. Note that the event horizon is independent from what 
happened before inflation. 




(1.77) 
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observed today. The reason is that the actual particle horizon d p (td ec ) is much larger than dn^dec); 
i.e. at some time in the past there already were interactions on scales much larger than dn(t<icc)- 

In a similar manner, inflation can also solve the other raised questions. It is not yet clear what 
initiated the inflationary era. Theorists argue that inflation might happen under very general con- 
ditions and that our universe might be only a homogeneous and isotropic "patch" within a huge 
chaotic and inhomogeneous universe ("chaotic inflation", Mukhanov 2005, Sect. 5.6). If this was the 
case, inflation would not only explain why our observable universe is remarkably flat, it would also 
explain why we live in a FLRW universe at all (Weinberg 2008). So, for the sake of simplicity, we 
will assume that for the description of inflation we can start with a flat FLRW universe (if it was 
not, inflation would make it so) and try a proper assessment of the whole inflationary paradigm in 
Section 1.6.4. 



1.6.2. Slow-roll inflation 

As discussed in Section 1.3.2, the universe is only accelerating if it is dominated by one or several 
energy components with equations of state wj < —1/3. So this condition must be satisfied during 
inflation. The simplest way to achieve this is by means of a real scalar field 4>(x). The associated 
theory is rather technical and is presented in Appendix A. So far, there has been no detection of a 
scalar field in particle physics, but the Higgs field being part of the standard model of particle physics 
would be a field of this kind and will perhaps be detected by the LHC in CERN. 

In the FLRW framework, the scalar field 4>(t) is homogeneous and thus only a function of time. 
As is shown in the Appendix A. 2, a scalar field <p moving in a potential V((f>) behaves like an ideal 
fluid with an effective matter density and pressure 

P<t> = \fi + V(<f>) , P4> = \<P 2 ~ V(<f>) , (1.78) 

respectively, and obeys the equation of motion 

4> + 3Hj> + V'(<l>) = 0, (1.79) 

where V'(<p) = dV/dcj). The term (j) 2 /2 is like the "kinetic energy" of the field, and the term 3H(fi in 
the equation of motion comes from the expansion of the universe and acts like a friction. With the 
expressions in Eq. (1.78) the equation of state of the scalar field is 

with the bounds —1 < w^it) < 1. This equation of state is generally time dependent. However, if 
(j? < 4V(<£) then < —1/3 and the condition for inflation is satisfied. Moreover, if 4> 2 <C V{4>), 
then the equation of state even becomes w ~ — 1 and is constant. The associated expansion of the 
universe is then exponential (see Eq. (1.33)) with H and <p being roughly constant. 

So far, the form of the potential V((j)) is undetermined and there are many possible inflationary 
scenarios proposed in the literature (see e.g. Liddle & Lyth 2000). However, the predictions from the 
simplest models of inflation are rather robust and so it is not necessary to specify all the details as 
long as certain general features are satisfied. The simplest class of inflationary models is called slow- 
roll inflation. These models contain a single scalar field <p called the inflaton and are characterized 
by the conditions 

</» 2 <y(0), |0| < \$H4\ ~ \v'(<j))\ . (i.8i) 
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That is the "kinetic energy" 4> 2 /2 of the field is small compared to the potential V(4>) and thus the 
(j) is rolling slowly down the potential V((f>). In this approximation, the equation of state is in fact 
w ~ —1 and the expansion of the universe is roughly exponential. The Friedmann equation (1.19) 
for K = and the equation of motion (1.79) simplify to 



H* = ^V{<i>), 3H4 + V'(<f>) = 0, (1.82) 
respectively. Slow-roll inflation is usually quantified by the slow-roll parameters 



1 fV'\ 2 H 3</» 2 3., , 1 V" 4> oox 

e ^16^{v) = ~Hi = 2V = 2 il + W * ) > ^^G^ = e ~iry (L83) 



where the different expressions are obtained by using the Eqs. (1.78) and (1.82). Comparing the 
Eqs. (1.81) and (1.83) shows that the slow-roll conditions are equivalent to 



e < 1 , n < 1 . 



(1.84) 



Since e <C 1 and rj <C 1 are equivalent to \V'/V\ <C 1 and \V" /V\ <C 1, respectively, the slow roll 
approximation is automatically satisfied if V{4>) is sufficiently flat. 



1.6.3. Generation of the primordial perturbations 

Probably the most important aspect of inflation in the context of structure formation is the possible 
associated production of tiny perturbations in the early universe that act as density seeds at the be- 
ginning of structure formation. The mechanism employs quantum mechanical processes. Mukhanov 
(2005, Sect. 8.2.3) gives an intuitively very clear description of how this works: 

Actually, inflation smooths classical inhomogeneities by stretching them to very large scales. How- 
ever, it cannot remove quantum fluctuations because in place of the stretched quantum fluctuations, 
new ones are generated by means of the Heisenberg uncertainty relation. But why is inflation needed 
for this process? The reason is that in Minkowski space, typical amplitudes of vacuum metric fluc- 
tuations are very small. They are only large near the Planckian scale. On galactic scales they are 
smaller than 10~ 58 and thus could never produce the perturbations of 10 -5 as measured in the CMB. 
The only way of producing such fluctuations on large scales is by stretching the very short wavelength 
fluctuations without decreasing their amplitude. As long as the fluctuations are within the horizon, 
they in fact continuously decrease when they are stretched, but as soon as they cross the horizon, 
the quantum mechanical fluctuations become classical and are indeed "frozen". That is they are 
stretched to galactic scales with almost no change of amplitudes. So inflation is necessary for the 
generation of perturbations, since only during an inflationary era the comoving Hubble length d^it) 
decreases such that comoving scales can exit the horizon. That perturbations are frozen outside the 
horizon is also the only reason why we are able to use inflationary theories to make any predictions at 
all about observational perturbations (Weinberg 2008). Remember that we basically know nothing 
about fundamental physics at the time when inflation happens and nobody knows exactly how the 
inflationary era turned into the radiation dominated universe of the concordance model. So all these 
unknown processes happened when the perturbations that are observable today were well outside 
the horizon and thus unaffected by any unknown physics. 

Also the statistical properties of the spatial perturbations created by inflation coincide very well 
with observations. A measure for the distribution of perturbations is the power spectrum P(k) (see 
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Sect. 2.2.1). Slow-roll inflation predicts a functional form of P{k) as (see Sect. 4.3) 

P(k)ock n % n s = 1-66 + 27?, (1.85) 

where n s is the spectral index and e and r) are the slow-roll parameters (see Eq. (1.83)). So we expect 
n s ~ 1 and maybe slightly below. The current observed value is in fact n s ~ 0.97 (see Tab. 1.1) in 
excellent agreement with the expectations from slow-roll inflation. Other predictions from inflation 
about the nature of perturbation are that perturbations should be Gaussian and adiabatic (see Ch. 4). 

1.6.4. Final remarks 

In this section we have outlined the simplest scenario of inflation and we have shown how it can 
solve a couple of problems arising in the concordance model and how it can set the framework for 
a FLRW universe. How sure can we be that such a scenario really took place in the very early 
universe? A conclusive answer to this question cannot be given. On the one hand, there are a 
couple of robust predictions of the simplest class of inflationary models, but on the other hand, 
by introducing extra parameters and by fine-tuning one can alter these robust predictions and, for 
instance, also produce FLRW universes that are open. While so far the robust predictions of the 
simplest inflationary models seem to be confirmed by observations, the whole underlying physics of 
inflation is very speculative and there might be other reasons for why our universe is very close to 
flat etc. Mukhanov (2005, Sect. 8.6) argues for a proper consideration of the "price-to-performance" 
ratio of inflationary theories in the sense that by an increase of the complexity of the models their 
predictive power is decreasing. The most attractive feature of inflation is certainly its simplicity 
and so for a proper assessment of its usefulness one has to presumably separate its phenomenology 
(i.e. the basic processes and robust predictions) from the underlying physical models (i.e. what fields 
are involved, how did it start, how did it end) which are extremely speculative and which can possibly 
never be confirmed by observations in the future. 

The concept of chaotic inflation - whether true or not - also puts a new complexion to the 
cosmological principle (see Sect. 1.1). It constitutes sort of a mechanism to produce a homogeneous 
and isotropic "patch" within a much larger inhomogeneous and anisotropic chaotic universe. As long 
as this homogeneous and isotropic patch is much larger than the Hubble length (i.e. our effective 
observable universe) we are entirely unaffected by the universe outside this patch and our observable 
universe behaves like a proper FLRW universe. This shows how idealistic and far reaching the 
cosmological principle is if interpreted in a strict sense. In the end we are, in fact, unable to make 
any firm statements about the universe beyond our observable universe and if chaotic inflation is 
taken at face value, our initial assumption of the homogeneous and isotropic universe leads us via 
inflation to the conclusion that (globally) the universe is not homogeneous and isotropic at all. 
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Chapter 




Newtonian theory of structure formation 



The Universe is homogeneous and isotropic on scales larger than 100 Mpc, but on smaller scales we 
observe huge deviations from the mean density in the form of galaxies, galaxy clusters, and the cosmic 
web being made of sheets and filaments of galaxies. How do structures grow in the universe and how 
can we describe them? In this chapter, we develop the Newtonian theory of structure formation and 
introduce the basic statistical equipment for quantifying them. Since we will entirely work within 
the concordance model outlined in Section 1.5, we will stick to a flat geometry. This simplifies our 
treatment enormously insofar as it allows the usage of Fourier transformation to decompose the 
structures of the universe into single independent modes. We prefer to first present the theory of 
Newtonian structure formation before going to the general relativistic theory in the next chapter, 
since the Newtonian approach is not only technically much simpler, but also sufficient to understand 
most of the processes which are well within the horizon. 1 Nevertheless a full analysis of structure 
formation in the universe starting with small perturbations generated during inflation is not possible 
in terms of Newtonian physics and therefore requires a general relativistic treatment. This will be 
done in the next two chapters, where we will use the Newtonian results developed in this chapter to 
interpret the general relativistic results. 

In Section 2.1 we develop the basic equations governing the growth of structures by solving the 
corresponding hydrodynamical equations at linear order. The domain where these equations are 
valid defines the linear regime. In Section 2.2 we introduce the correlation function and the power 
spectrum for a precise and quantitative description of cosmological structures and we explore how 
these statistics evolve with time as the structure grows. Using these statistics we will define the two 
cosmological parameters a$ and n s , which we have already encountered in Section 1.6. Finally, in 
Section 2.3 we introduce approximations to treat the nonlinear growth of cosmic structures in an 
analytic approximative way. This will lead to the concept of the "halo" and the corresponding theories 
about their abundance and statistical distribution in space. We finish this section by presenting the 
"halo model", which is a very simple, but powerful theory for describing the galaxy correlation 
function in the linear and nonlinear regimes. 



lr The relation between Newtonian physics and general relativity in the context of structure formation is discussed in 
Lima et al. (1997) and Noh & Hwang (2006). It is interesting that not only the homogeneous and isotropic world 
models of Chapter 1, but also their linear structures were first studied in the context of general relativity and not 
Newtonian physics. 
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2.1. Linear perturbation theory 

In the framework which was outlined in Section 1.5.2, the LSS that we can see today started with very 
small initial deviations from the homogeneous FLRW model and grew by gravitational instability. At 
epochs when these deviations are very small, they can be treated as perturbations around the smooth 
background, while we keep only terms of first order in perturbation quantities. This is called "linear 
theory" and the regime where this approach is valid is called "linear regime". The corresponding 
Newtonian theory was initially formulated by Bonnor (1957) for an expanding universe. We follow 
mainly the introductions given in Coles & Lucchin (2002, Ch. 10) and Mukhanov (2005, Ch. 6). 

2.1.1. Newtonian hydrodynamics in an expanding universe 

Suppose the universe is filled with an inhomogeneous, dissipationless, ideal fluid with matter density 
p(t,r), velocity field v(t, r), pressure p(t,r), gravitational potential 5>(i, r), and entropy per unit 
mass S(t,r), where t denotes the cosmic time and r physical coordinates. The fluid is governed by 



the basic hydro dynamical equations of Newtonian physics: 

dp 

continuity equation: — + V • (pv) = (2-1) 

dv VjD 

Euler equation: — + (v ■ V) v H + V<I> = (2.2) 

ot p 

Poisson equation: V 2 <3? = AirGp (2.3) 

dS 

conservation of entropy: — — h (v ■ V) S = . (2.4) 



These equations taken together with the equation of state p = p(p, S) form a closed system of 
equations and determine the seven unknown functions p, v, p, <E>, and S. Note that the Eqs. (2.1)- 
(2.4) are only valid for non-relativistic matter, i.e. it must hold \v\ <C c and p <C pc 2 . Since for a 
fluid a unique velocity vector is associated to any point in space, the motion of matter must be in 
the single stream regime, i.e. adjacent particles move in approximately parallel trajectories and do 
not cross. This is a reasonable assumption in the linear regime. Mixing of different streams at the 
same point in space ("shell crossing") does not occur until the structures have grown nonlinearly. 
Moreover, our simple ideal fluid model does not account for diffusion processes (e.g. "free streaming" 
of relativistic particles) which erase small scale perturbations. However, since DM is cold and does 
hardly interact with other particles, we can neglect such effects for the study cold DM. In order to get 
accurate models for structure formation, one would have to solve the general relativistic Boltzmann 
equation taking into account all sorts of energy contributions in the universe and their interactions. 
The corresponding procedure is outlined in Section 3.5.2. 

Unfortunately, the hydrodynamical equations (2.1) are nonlinear and it is very difficult to find 
their general solution. So, assuming that the universe is close to a FLRW universe, we can perturb 
the fluid around its Hubble flow and solve the hydrodynamical equations at first order (or "linear 
order") in the perturbed quantities. Thus we split each quantity into a homogeneous background 
contribution (indicated by a bar) and an inhomogeneous perturbation (indicated by a 5) , where the 
perturbed quantities are small compared to their background: 

pit, r) = p{t) + 5p(t, r) , v(t, r) = v(t, r) + 5v(t, r) , 

p(t,r)=p(t) + 5p(t,r) , $(*,r) = $(t,r) + <S$(t,r), (2.5) 

S(t,r) = S(t) + SS(t,r) . 

Note that since r are physical coordinates rather than comoving, the homogeneous velocity field 
vit, r) and the homogeneous gravitational potential are nonzero and even depend on the physical 
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position. The homogeneous velocity field is given by the Hubble Law 

v(t,r) = H(t)r . (2.6) 

This is easily seen using Eq. (1.13), since the homogeneous matter field is at rest with respect 
to comoving coordinates. The perturbed velocity field Sv is the field of peculiar velocities 
(cf. Sect. 1.4.1). 

How do the background quantities in Eq. (2.5) relate to the general relativistic approach of the 
homogeneous universe in Chapter 1? The hydrodynamical equations for the homogeneous background 
yield 

-p + 3Hp = 0, H + H 2 = -^l, (2.7) 

the first being the continuity equation and the second the divergence of the Euler equation and 
substituting the Poisson equation. The Eqs. (2.7) are the equation of motion (1.23) and the second 
Friedmann equation (1.20) for pressureless matter. Note that the pressue p of the fluid does not enter 
the Friedmann equation in the Newtonian approach and we have already assumed that it is small. 
Thus we are consistent with general relativity and can assume that the hydrodynamical equations 
(2.1)-(2.4) for the homogeneous quantities are satisfied. If one wants to treat fluids with considerable 
amount of pressure in a Newtonian approach, one has to be very careful to avoid inconsistencies (see 
Lima et al. 1997) as discussed in the following paragraph. 

If there are further energy contributions in the universe which do not interact with our fluid 
except by gravitation and if these contributions are homogeneous, e.g. a cosmological constant or a 
homogeneous radiation background not interacting with the matter, these fluids enter our formalism 
only through a homogeneous (i.e. unperturbed) term in the Poisson equation and so do not alter 
our first order perturbation equations. They alter, however, the expansion of the universe. Note 
that such additional fluids could introduce (relativistic) pressure. To reconcile the Newtonian and 
general relativistic approach for our calculation, we just assume that the hydrodynamical equations 
(2.1)-(2.4) for the homogeneous quantities are satisfied (although we are aware that they may not be 
due to relativistic effects) and that the expansion is governed by the relativistic Friedmann equations 
(1.19). 

Substituting the Eqs. (2.5) into the hydrodynamical equations (2.1)-(2.4) and keeping only terms 
to first order in perturbed quantities yields 

continuity equation: + p V • Sv + V • (Sp v) = 

Euler equation: — — + (Sv • V) v + (v • V) Sv + — (c 2 5p + a SS) + V<5$ = 

at p (2-8) 

Poisson equation: V 2 5$ = AttG 5p 

dS 

conservation of entropy: — + (v ■ V) SS = . 

For the Euler equation we have used the expansion l/(p + Sp) ~ 1/p + Sp/p 2 + . . . and substituted 
the equation of state p(p + Sp,S + SS) at first order 

Sp = c 2 s Sp + a SS (2.9) 

with c 2 = (dp/dp)g the square of the speed of sound and a = (dp/dS)p. 

We can further simplify the equations (2.8) by transforming them into the comoving frame. This is 
done by the following transformation from physical coordinates (t, r) to comoving coordinates (t, x): 

1 d d 1 

r = ax, V r = -V x , — =— --v-V x . (2.10) 

a ot „ ot „ a 
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Substituting these expressions into the Eqs. (2.8) and taking the Fourier transform with respect to 
comoving coordinates, i.e. for any perturbed quantity dQ(t,x) holds 2 



SQ(t,k) = [ 5Q(t,x)e- ikx dx 3 , 6Q(t,x) = [ SQ(t,k) 

J (2tt) J 



i ikx dk 3 



(2.11) 



with k the comoving Fourier modes, we obtain the first order hydrodynamical equations for a given 
mode k: 



continuity equation: Sp + 3H5p H ■ Sv = 

a 

% k 

Euler equation: 5v + H5v + — (c?5p + aSS) H 5® = 

ap v ' a 

Poisson equation: k 2 5& + An^Ga 2 bp = 
conservation of entropy: 6S = . 



(2.12) 



Here, k = \k\ and a dot denotes the derivation at fixed k. For the first equation we have used 
V • v = 3H and for the second equation (5v • V) v/a = 5vH, since v = Hr = ax. 



2.1.2. Perturbation modes 

The Eqs. (2.12) are five coupled linear first order differential equations and one algebraic relation, so 
we expect the general solution to be a superposition of five linear independent modes. These can be 
characterized as follows: 



Entropy mode The conservation of entropy allows a static entropy perturbation 

5S(t,k) = 5S{k) (2.13) 

with appropriate 5p, 5v and <5$, so that the other equations are satisfied. Note that entropy pertur- 
bations can only occur in a multi-component fluid (e.g. photon-baryon plasma before recombination). 
Since the matter is dominated by cold DM and this matter does hardly interact with any other energy 
contribution, we will neglect entropy perturbations in our following discussion. Furthermore, entropy 
perturbations are rather "unnatural" insofar as the simplest models of inflation do not predict any 
entropy perturbations (see Sect. 4.2.1). If there are no entropy perturbations at the beginning there 
will be none created due to Eq. (2.13), and preexisting entropy perturbation might even become 
erased (see discussion in Weinberg 2008, Sect. 5.4). To gain insight into the behavior of entropy 
modes, we assume that the universe is static 3 , i.e. H = and p = const. The entropy mode is 
solved by 5v = and bp = const, so density fluctuations do not grow in this mode. Thus entropy 
fluctuations are not interesting in the context of structure formation. 



2 For these integrals to converge, we formally assume that dQ(t, x) = for \x\ > L with the cut off scale L being much 

larger than any other scale of interest so that it does not play any role. 
3 Note that a static homogeneous universe with p / is no solution of the Eqs. (2.1)-(2.4), because in a static universe 

we have v = 0, so that the Euler equation becomes V<5"1 > = 0. Inserting this into the Poisson equation yields p = 

contradicting our assumption. So a homogeneous universe in Newtonian physics must either expand or contract. 

This problem is solved by arbitrary assuming that the Poisson equation holds only for the perturbed quantities. 

This is called the "Jeans swindle" (see e.g. Binney & Tremaine 2008, Sect. 5.2.2). 
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Vortical modes Setting bp = 5® = 5S = and k-dv = 0, the Euler equation becomes 5i)+H5v = 
with the solution 

5v oc - . (2.14) 
a 

In an expanding universe these modes can only decay and so we can neglect them. 

Adiabatic modes We set 5S = and k \\ 5v. Expressing the continuity equation in terms of the 
matter overdensity 

mt)s ^^KM_m ( , 15) 

yields 

5+-Sv = 0. (2.16) 
a 

If we differentiate this equation, we can first eliminate 5v from the Euler equation and then 5v with 
Eq. (2.16). Then eliminating 5& with the aid of the Poisson equation we obtain 



' 2 1,2 

5 + 2HS+[% r - AttGp )5 = 0. 



a? 



(2.17) 



This is a linear second order differential equation and allows two independent solutions, which can 
grow under certain conditions. Note that with these solutions at hand we can immediately compute 
the peculiar velocity field that is generated by the perturbations 5 by means of Eq. (2.16). 

So we have found all modes. The most general solution of the Eqs. (2.12) is a superposition of 
two vortical modes, two adiabatic modes, and one entropy mode. Among these, only the adiabatic 
modes are interesting in the context of structure formation and we will merely focus on these in the 
following. 

2.1.3. Linear growth function 

To gain insight into the dynamics of adiabatic perturbations, we again consider the case of a static 
universe. Eq. (2.17) has then simple analytic solutions for each fc-mode. If the third term in Eq. (2.17) 
is negative, there are exponential growing and decaying solutions, but if it is positive, both solutions 
are oscillating. This is called the Jeans criterion. Only modes whose physical wavelength A is larger 
than the Jeans length Aj can grow, i.e. 

A = lr >* I = fl> ^. (2.18) 
Similarly, we can define the Jeans mass as 

M -sKt) 3 H^- <««> 

Only perturbations which are more massive than the Jeans mass are able to grow. Note that the Jeans 
mass is proportional to the speed of sound c s of the fluid. For baryons the speed of sound is a strong 
function of cosmic time. As long as the baryons are coupled to the photons in the early universe, 
the speed of sound is huge due to the photon pressure, while during recombination it decreases 
dramatically and becomes practically negligible (however see the discussion in Sect. 3.5.1). On the 
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other hand, the speed of sound of DM is basically negligible at all times. So baryonic perturbations 
can grow only after recombination, whereas DM is not constrained to this condition. 

In an expanding universe the behavior of the perturbations is qualitatively similar to that in the 
static universe. Perturbations which are larger than the Jeans length exhibit a growing and the 
decaying mode, while smaller perturbations constitute oscillating sound waves. However, the term 
in Eq. (2.17) which is proportional to H is now nonzero and acts like a "friction" ("Hubble drag") 
counteracting the dynamics of the perturbations. Let us concentrate on Fourier modes which are 
much larger than the Jeans length so that equation (2.17) reduces to 

5 + 2H 6 - AirGp 5 = , (2.20) 

which has the general solution 

S(t, k) = 6+{k) D + (t) + 5-{k) D_(t) , (2.21) 

where the subscript + denotes the growing and — the decaying mode. The functions D + (t) and 
D_(t) are independent real valued (fc-independent) solutions of Eq. (2.20) and <5+(fc) and <5+(fc) 
are complex valued initial conditions. The growing solution D + {t) is usually called linear growth 
function and just denoted by D{t). We adopt this notation and apply the normalization D(to) = 1. 

How fast do perturbations grow in an expanding universe? When the universe is matter dominated, 
the expansion rate is H(t) = 2/(3t) (see Eq. (1.72)), and the time evolution of perturbations is 
given by D + oc t 2 / 3 and D_ oc i -1 . Thus matter perturbations can only grow as a power law 
in the linear regime during matter domination and so structure formation is much more inefficient 
in an expanding universe than in a static one. During radiation domination matter perturbations 
grow even slower. Assuming that the universe is filled with DM and radiation, where radiation is 
assumed to be homogeneous distributed 4 , it can be shown that the growing mode during the whole 
radiation dominated era grows maximally about a factor of 2.5 (Coles & Lucchin 2002, section 10.11). 
This is called the Meszaros effect and says that DM perturbations during radiation domination 
are basically frozen even for perturbations much larger than the Jeans length. During the era 
dominated by the cosmological constant, the perturbations are even entirely frozen (Mukhanov 2005, 
Sect. 6.3.4). So DM perturbations can basically grow only during matter domination, where they 
grow proportional to the scale factor a. 

Unfortunately, Eq. (2.20) does not allow a closed, analytic solution for the concordance cosmology. 
However, the following fitting formula provides a sufficiently accurate approximation for all practical 
purposes at low redshift (Carroll et al. 1992) 5 

D ( Z ) = 1 g f z ) = ^nM (2 22) 

{i + z) 9 W nU 7 (z)-n A (z) + (l + n m (z)/2)(l + n A (z)/^o) , 



Ho ^ 



[H(z) 



(2.23) 



where (cf. Eq. (1.44)) 

r H I 2 

n m (z) = n m (i + zf ^-^yj , n A (z) = n A 

and H{z) is given by Eq. (1.69). 

4 This is, of course, an approximation, since the radiation perturbations are not zero and, if the adiabaticity condition 
holds (see Sect. 4.2.1), they even have the same amplitude outside the horizon as the DM perturbations. However, 
Weinberg (2008, p. 296) showed that well inside the horizon the radiation perturbations are smaller than the DM 
perturbations and so can be neglected for studying the growth of DM perturbations. 

5 For a flat universe and realistic values of fi m and Qa it is more accurate than one percent at any redshift, for which 
the radiation density is negligible. An exact solution for a universe with Q m + J1a = 1 is provided in Mukhanov 
(2005, Eq. (6.67)) in terms of an integral expression. 
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2.1.4. Transfer function 

The considerations in the last section concern only perturbations which are well within the horizon. 
For the dynamics of the perturbations outside the horizon, a general relativistic treatment is required. 
It can be shown (see Eq. (4.63)) that the modes at the time t- m , when they enter the horizon, scale 
like 

1 f a 2 (t in ) (radiation domination) 

[t m , K{t m) ) oc #2 (t . n)a 2 (t . n) \ a ( tin ) ( ma tter domination) , l ^ 4j 

where the size of the perturbation k depends on i; n and for the last step we have used Eq. (1.73) 
and Eq. (1.72) respectively. In this sense we can say that the perturbations outside the horizon grow 
like l/(aH) 2 . Since during matter domination, the perturbations outside and inside the horizon 
grow both proportional to a, perturbations that enter during matter domination are considered as 
"benchmark" to which other perturbations are related. The amplitude of the perturbation at horizon 
entry is called "primordial". The transfer function T(k) is then introduced by 

5(t,k) = 5(t hl ,k)T(k)r^, (2.25) 

where D is the linear growing function that includes matter and a cosmological constant, but no 
radiation. This means that T(k) considers all deviations in the evolution of the primordial pertur- 
bation which happen at early times during radiation domination and due to physical processes well 
within the horizon (e.g. matter-radiation plasma). For a mode k which enters well within the matter 
dominated regime and after decoupling, we have by definition T{k) ~ 1. 

How does the transfer function look like for a DM mode that enters during radiation domina- 
tion? Since DM perturbations cannot grow inside the horizon during radiation domination due to 
the Meszaros effect, modes at smaller scales are suppressed compared to those of larger scales. For 
simplicity, we assume that during radiation domination a mode entering the horizon freezes im- 
mediately and starts to grow like D oc a at matter-radiation equality t eq . Since during radiation 
domination perturbations outside the horizon effectively grow proportional to 1/a 2 (see Eq. 2.25) 
relative to those inside the horizon (which remain constant), the amplitude of a mode entering at 
*in < t cq is suppressed by the factor (a(t- m ) / a(t eq )) 2 . Moreover, the mode entering the horizon satis- 
fies k oc H(ti n )a(ti n ) (see Sect. 1.4.3), so that using Eq. (1.34) it follows a(ti n ) / a(t eq ) = k eq /k, where 
k cq is the mode that enters the horizon at matter-radiation equality i eq . So the transfer function for 
DM has the asymptotical behavior 



( , 1 if k cq /k > 1 

J W-1 {kc J k) 2 iffceq/A; «l. 



(2.26) 



To obtain the precise form for the transfer function at all scales, we would have to solve the general 
relativistic Boltzmann equation taking into account all sorts of energy contributions in the universe 
and their interactions. The corresponding procedure is outlined in Section 3.5.2. For instance, the 
corresponding transfer function Tb(fc) for baryons does also contain effects such as acoustic oscil- 
lations leading to a strong oscillation pattern in 7b(/c). These oscillations are then transferred by 
gravitational interactions to the spatial distribution of DM particles producing a weak oscillation 
pattern in T(k) which is called "baryonic acoustic oscillations". This feature, which has been con- 
vincingly detected (e.g. Percival et al. 2010), is a powerful confirmation of our model of the history 
of the universe and is useful for estimating cosmological parameters. A discussion of common fitting 
formulas for the transfer function is given in Section 6.5 of Weinberg (2008). 
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2.1.5. The nonlinear regime 

So far, we have assumed that 5 < 1 on all scales within the horizon. This condition is satisfied to 
high accuracy in the early matter dominated regime (e.g. 5 ~ 10~ 5 for a mode entering the horizon 
around recombination). At these times, Eq. (2.25) is a good approximation. However, as time goes 
by, 5 grows and there will be a point, where S < 1 and the linear approximation fails to be a good 
approximation. Around this time, nonlinear effects become important. Thus it will prove to be useful 
to divide up the overdensity S at any time into a linear and a nonlinear part 

5(t,k) = 5 lin (t,k) + 5 nl (t,k), (2.27) 

where the linear part 6n n is defined by Eq. (2.25) and the nonlinear part 5 Q \ is defined by Eq. (2.27). 
At early times, 5 n \ is basically zero and 5 ~ 5n n holds at all scales of interest (linear regime). 

What can we say about nonlinear structure formation? Unfortunately, not surprisingly the nonlin- 
ear evolution of the density field is very complicated and there is no analytical formula describing the 
general case. However, for certain special cases, analytic solutions can be found (see e.g. Sect. 2.3.1). 
To deal with the general case, one has to make use of large numerical simulations. This has become 
an extensive field within the branch of astronomy and a review would go beyond the scope of this 
introduction. In general, nonlinear effects mix different fc-modes and lead to a cosmic web which is 
made of sheets and filaments and in whose nodes are big galaxy clusters (see Fig. 1.1). The collectiv- 
ity of this cosmic web along with clusters and groups of galaxies is called the "large-scale structure" 
(LSS). 

2.2. Statistics of the overdensity field 

In principal, the overdensity S(t, x) contains all information about the LSS in the universe at any 
time. However, in order to characterize the structure in the universe and to compare observations of 
5 with theory, it is meaningful to think of 5 as a realization of a stochastic process. We can think of 
it like the initial inhomogeneities in the universe were created by a stochastic process and that this 
process was the same at every position. This lays the theoretical foundation for the cosmological 
principle. 

A possible candidate for such a stochastic process is discussed in Chapter 4. Since such a stochastic 
process is not only constrained to one point but to all space, the mathematical theory needed here 
is the theory of random fields. 6 In the following we regard 5 as a realization of a homogeneous and 
isotropic random field with zero mean. The random field itself will also be denoted by 5. 

2.2.1. Correlation function and power spectrum 

The simplest nontrivial statistics of an inhomogeneous universe is the 2-point correlation function 

£(x,x') = (6(x)8(x')) , (2.28) 

where (. . .) denotes the ensemble average (expectation value) of the stochastic process underlying 
the random field 5. Since our random field is homogeneous and isotropic, the correlation function 
can only depend on \x — x'\, i.e. £(\x — x'\) = £(x,x'). If £(r) is additionally continuous at r = 0, 



For a general introduction into the theory of random fields see, for example, Miller (1975), Adler (1981), or Adler & 
Taylor (2007). 
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there exists a spectral representation of the field, i.e. we can decompose it in Fourier modes S(k) and 
it holds (spectral representation theorem) 7 

(6(k)S*{k')) = (2irf 5 D (k - k') V(k) , V{k) = j ^{x) e - ikx dx 3 . (2.29) 

The function V(k) is called power spectrum. Due to the reality of £(r) it holds V(k) = V*(—k) and 
due to the rotational invariance of £(r) it holds V(k) = V(k), so V(k) is a real function. Moreover, it 
is also nonnegative for all k. The power spectrum contains exactly the same amount of information 
as the correlation function, but depending on the application it may, however, be useful to prefer one 
or the other (e.g. Feldman et al. 1994, Sect. 1). 

A widely used kind of random fields are the Gaussian random fields being the simplest and most 
natural. A homogeneous real Gaussian random field 8 with zero mean is fully characterized by its 
finite dimensional distributions. That is for N points in space x±...xn the probability density 
function is a multivariate Gaussian 

f(S( Xl ),...,S(x N )) = (27r)jV/2 ^^y exp ^-i 8{xi) (V-% S{xj^j (2.30) 

with the covariance matrix given by = £(|ajj — Xj\). Hence the correlation function £(r) determines 
the random field entirely. 

The Fourier transform S(k) of a Gaussian random field has for each fc-mode a real and imaginary 
part, which are independent and Gaussian distributed with zero mean and variance V(k)/2. This 
is equivalent for 5(k) having a uniformly distributed random phase and a modulus \5(k)\ which is 
Rayleigh distributed with variance V(k). Additionally, each fc-mode is independent from the others. 

By introducing the ensemble average (. . .) we referred to a stochastic process taking place in the 
early universe. However, the observable LSS constitutes a single realisation of this process, so the 
question arises how these ensemble averages might be measured in practice. To be able to infer 
anything about the underlying stochastic process one has to postulate some sort of "ergodicity" or 
"fair sample hypothesis". Ergodicity refers to the mathematical property of random fields that 
volume averages converge to ensemble averages as the survey volume goes to infinity. In general 
it is hard to prove that a random field has this property. However, it can be shown that a zero 
mean, homogeneous, real Gaussian random field is ergodic if £(r) — > for r — > oo (Adler 1981, 
Thm. 6.5.4). For a general random field, ergodicity is a valid assumption if the length scale over 
which the average is computed is large enough, so that the spatial correlation become negligibly 
small (see Weinberg 2008, App. D). On the other hand, the fair sample hypothesis (Peebles 1980) 
states that well separated parts of the universe can be regarded as independent realizations of the 
underlying stochastic process and that the observable universe contains many such realizations. 

While ergodicity is a precise mathematical statement which may or may not apply to a given 
random field, the fair sample hypothesis is more vague. However, Watts Sz Coles (2003) pointed out 
that the fair sample hypothesis is stronger than ergodicity and probably more useful for studying 
the LSS, because to obtain a fair sample it is not necessary to average over an infinite volume, which 
is practically impossible. Whichever hypothesis finally applies, most present day galaxy surveys are 
way too small to constitute a fair sample (especially at high redshift) and thus averages over the 
volumes of such surveys are subjected to statistical fluctuations. This phenomenon is called sample 
variance or cosmic variance if the sample is constrained by the size of the observable universe 
(e.g. CMB). The two terms are, however, often used interchangeably. 



7 See e.g. Adler (1981) Theorem 2.4.1 together with Theorem 2.2.1. 
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2.2.2. Initial conditions and linear power spectrum 

Even in the absence of any mechanism producing perturbations in the early universe and well before 
the idea of inflation, there was a preferred ansatz for the initial power spectrum of the form (Harrison 
1970; Zel'Dovich & Novikov 1970; Peebles & Yu 1970) 

T(k) oc k ns (2.31) 

with n s the spectral index. The virtue of this ansatz is that it does not introduce any particular 
length scale. If n s = 1 then this spectrum is called the Harrison-Zeldovich spectrum and has the 
preference of being scale invariant (see Sect. 4.3), which is a natural expectations. In any case, there 
are fairly general reasons to constrain the spectral index within —3 < n s < 4 (see e.g. Peacock 1999, 
Sect. 16.2). A further assumption about the initial perturbation is that it constitutes a realization 
of a Gaussian random field which, again, is probably the simplest and most natural choice. 

Today we are in the favorable situation that both of these assumptions could be verified to a high 
degree by observations of the CMB (e.g. Komatsu et al. 2011) and that we also have a theory at 
hand which explains how this initial state could have been produced. As we will show in Chapter 4, 
the simplest model of inflation being governed by a single inflaton field produces an initial density 
field which can be regarded as a realization of a Gaussian random field whose power spectrum for 
modes entering the horizon during matter domination is (see Sect. 4.3) 

V(t, k)(xk n % n s = 1 - 6e + 2n , (2.32) 

where e and rj are the slow-roll parameters (see Eq. (1.83)). 

Similar to Eq. (2.27) it is reasonable to split up the power spectrum (and likewise the correlation 
function) into a linear and a nonlinear part 

V(t, k) = Vl in {t, k) + V n l(t, k) , (2.33) 

where the linear power spectrum is just the power spectrum of the linear overdensity field Sn n and is 
given for any time after t eq by (see Eqs. (2.25) and (2.31)) 



V ]in (t,k) = A T 2 (k) D\t) 



(2.34) 



with Ao 8 its normalization at the present time to (recall that we set D(to) = 1). The nonlinear part 
V n \ is then defined by the difference between the total and the linear power spectrum. 

The linear power spectrum for the concordance model together with a compilation of measurements 
is shown in Figure 2.1 At large scales the power spectrum is basically given by the primordial power 
spectrum P(k) oc k and on small scales it is affected by the physical processes inside the horizon 
(e.g. Meszaros effect) so that P(k) oc k~ 3 as expected by the asymptotical behavior of the transfer 
function (2.26). Obviously, the stagnation of the growth of perturbation during radiation domination 
introduces a distinct length scale into the linear power spectrum, which separates these two regimes 
and which is of the size of the horizon at radiation-matter equality. The overlap of the different 
measurements shown in Figure 2.1 impressively demonstrates the success and the consistency of the 
paradigm of structure formation in the linear regime. 

8 Since k is not dimensionless, we have to be careful in interpreting Eq. (2.34). The expression k n " = e"*' 11 ''' is not 
well defined in general, as we cannot build the logarithm for a dimensioned quantity. To make sense of fc" s we 
have to introduce a reference mode feo (e.g. ko = 1 Mpc -1 ) to scale out the unit of k as (fc/fco)" a . Therefore the 
normalization Ao carries the full dimension of the power spectrum and depends on the adopted reference mode ko- 
For this reason, a numerical value of Aq always has to be stated for a corresponding reference mode ko . 
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Wavelength A [h -1 Mpc] 
10 4 1000 100 10 1 




Wavenumber k [h/Mpc] 

Figure 2.1. Linear power spectrum at the present epoch. The solid line is the model of 
the linear power spectrum for the concordance model and the points with errorbars show the 
different measurements as indicated in the legend. The methods by which these measurements 
were maped onto /c-space are explained in Tegmark & Zaldarriaga (2002). It is obvious that 
it holds on large scales P(k) oc k and on small scales P(k) oc &~ 3 . The turnaround roughly 
marks the scale of the horizon at the epoch of radiation-matter equality. (Taken from Tegmark 
& Zaldarriaga 2002, Copyright (2002) by The American Physical Society.) 



The total power spectrum is equal to the linear power spectrum if S\i n <C 1. The contribution due 
to V n \ is practically negligible at early times and becomes more important as perturbations grow. 
However, since the primordial power spectrum is a power law on large scales, there is for any time 
t a threshold fc t h(t) so that for modes with k < k t h(t) the overdensity 5 is so small that we can 
assume V(t, k) ~ Vu n (t, k) (linear regime). The regime with k > k t h(t), where V(t, k) ~ V n \, is called 
the nonlinear regime. Note that nonlinear effects not only mix different fc-modes, but also spoil 
Gaussianity 9 . 



A simple way to see this is by noting that by definition 6 > — 1, while on the positive side there is no such constraint, 
so the distribution function of 8 becomes asymmetric and cannot be Gaussian anymore. 
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Sometimes it is convenient to express the linear power spectrum by means of an effective spectral 
index n c g- defined as 

^-<«^ 

such that 

k n oB (k) = f,n sT 2^ ^ggj 

With Eq. (2.26) we obtain immediately n e g(k) ~ n s for k » fc eq and n e s(k) ~ n s — 4 for k <C k eq . 

2.2.3. Filtering and moments 

In the real universe, the overdensity field S has powers on all scales. Particularly on very small scales, 
deviations of the matter density from the mean density can become huge. For instance, if we apply 
5 = (p — p)jp naively to the Earth, we obtain an overdensity of about 10 30 for every position inside 
the Earth, but this huge overdensity has no meaning for cosmology and makes it meaningless to 
search the overdensity field for peaks with cosmological relevance. Hence, an important concept in 
cosmology is filtering, where contributions to the density field below a given length scale are filtered 
out. Mathematically, this is obtained by convolving the overdensity with some window function 
W(r), i.e. 

S Rf (t,x) = (S*W)(t,x) = J 5{t,x-x')W{\x'\,Ri)dx' 3 , (2.37) 

where the window function W is associated with a comoving length scale Rf beyond which it is 
essentially zero and is normalized for all Rf such that J W(\x\, Rf)dx 3 = Air j W(r, Rf)r 2 dr = 1. So 
the filtered overdensity 6r { (t, x) is the overdensity smoothed at every position over a scale of Rf and 
features that are smaller than this length scale are washed out. The most common window function 
in cosmology is the top-hat filter with radius i?f 

w , R1 / 3/(47ri? f 3 ) ifr<i? f 

W TH (r,Rf) = { Q {ir>R{ (2.38) 

with its Fourier transform 

W T]1 (k, Rf) = T^Jy ( sm(kRf) - kR t cos(kRf)^j . (2.39) 

We will stick to this case throughout this introduction. 

In the following we are interested in the statistical moments of the filtered linear overdensity field. 
Since the linear overdensity e>n n has zero mean, we obtain for the first moment immediately 

(6k (t, x)) = J (5 Vm (t, x - x')) W{x', Rf) dx' 3 = . (2.40) 

The second moment, however, is nontrivial and can be expressed by the linear power spectrum as 

4 f W = (Skfr*)) = (S R{ (t,x)5* R{ (t,x)) = ((J- 1 ^-) (J- 1 ^.)*) (2.41) 

= (2^ / / W*'*)^*'*')) W(k,Rf)W*(k',Rf) e^- fe > dk 3 dk' 3 (2.42) 



{2n) , , V\in(k) \W(k, Rf)\ dk A , (2.43) 
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where we have denoted the Fourier transformation operator by T and we have applied the definition 
of the power spectrum (2.29) to the linear overdensity field. If the /^-dependence of W(k, Rf) appears 
only in the combination kRf (cf. Eq. 2.38), we can define W(kRf) = W(k,R{) so that 



^) = ^^ j k n ^ k)+2 \W{k Rl 



i)\ z dk 

A.D 2 (t) f 1 / v \ n cs(y/R{)+2 ( 2 - 44 ) 
A ° D{t) f 1 \W{y)?dy, 



2vr 2 J Rf \Rf 

where we used the explicit form of the linear power spectrum (2.34) with the effective spectral index 
(2.35). If we scale out i? f ( ra ° ff+3 ) with n c g evaluated at 2n/Rf, the integral becomes approximately 
independent of Rf in the range 0.8 Mpc < Rf < 40 Mpc, so that it holds for this range 10 

a R( (t) oc D{t) #7 (neff+3)/2 . (2.45) 

For certain applications it is convenient to express the filtered density field in terms of the mass 
involved. We define the typical mass associated to each point of SR ( (t, x) by M(Rf) = poV(Rf) 
with the comoving volume V(Rf) = AirRf/S. Since Rf and M are uniquely related to each other, 
we will use the notations (5r { , cr,R f ) and (5m, &m) interchangeably. With Eq. (2.45) it follows the 
approximation 



a M {t) oc D(t) M-( n « ff+3 )/ 6 (2.46) 

for masses in the range 10 11 M & < M < 10 16 M & and n e fr evaluated at the corresponding scale 
2vr/i? f . 

For a top hat filter Wth with radius Rf = 8/i -1 Mpc, (7r { (to) for the present time to is denoted by 
o"8 being a cosmological parameter. This parameter fixes the normalization of the linear power 
spectrum (2.34) as 




sin(fc-Rf) — kRf cos(/ciV ~ 
(kRfT 



dk , (2.47) 



with Rf = 8 hr x Mpc. The physical meaning of ag is how much the mass within boxes of Rf = 8 hr 1 
fluctuates from one place to another in the present day universe. Since at the present time the scale 
8 ft. -1 Mpc is already mildly nonlinear and since ag considers only the linear overdensity field, as is a 
rather abstract quantity from an observational point of view and cannot be estimated, for instance, 
by counting galaxies in boxes of 8 h~ l Mpc. As we shall see in Section 2.3.2, ag is very sensitive to 
the number density of clusters in the universe. The current estimated value is (see Tab. 1.1) 

a 8 ~ 0.8 , (2.48) 

although different methods such as measuring number density of galaxy clusters and weak lensing 
surveys typically yield slightly different values for ug probably owing to systematic errors inherent in 
these methods. 



10 For a top-hat filter the integral changes about a factor of 2 over the indicated range. For simplicity we will assume 
the integral to be roughly constant and use it only for order of magnitude calculations. 
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2.3. Dark matter halos 

In this section we discuss how the linear theory that has been developed in the last two sections can 
be related to the complex nonlinear evolution of the LSS. First, we present the spherical top hat 
model as a simple model for the formation of DM halos, then we try to gain insight into the statistics 
of the these halos by computing approximative expressions for their number density and spatial 
correlations. Finally, we introduce the halo model which is a simple approach to understanding the 
behavior of the galaxy correlation function in the linear and nonlinear regime. 



2.3.1. Spherical top hat collapse 

Suppose a small spherical, homogeneous perturbation that is imbedded in a homogeneous background 
universe. Since the present day structures grew from tiny inhomogeneities produced in the very early 
universe, we require that the perturbation at early times approaches the density of the background 
and expands in accordance with it. In this section we investigate how the overdensity of such a 
perturbation evolves with time. 

For simplicity we assume that the background universe with density pit) is flat and matter dom- 
inated and that our perturbation with density p(t) > p(t) and radius R p (t) is symetrically placed 
within a spherical cavity that expands with the background. The radius R(t) of the cavity is chosen 
such that p(t)Rp(t) = p(t)R 3 (t), i.e. if the mass of the overdensity was uniformly distributed within 
the cavity, it would just approach the density of the background. With these (simplistic) assumptions 
the evolution of the overdensity 5(t) = (p(t) — p(t))/p(t) can be easily computed analytically. Our 
requirements on the initial conditions translate to p(t) ~ p(t) and R p (t) ~ R(t) for small t. 

The condition p(t)R 3 (t) = p(t)R 3 (t) guarantees that the background expands undisturbed by the 
perturbation due to the Newtonian theorem for a spherical mass distribution. 11 Moreover, since 
the background is distributed spherically symmetric around the perturbation, the perturbation also 
evolves independently from the background. Thus the perturbation and the background are entirely 
decoupled. 12 The dynamics of the background are then simply determined by the familiar Friedmann 
equation (1.19) for K = 

da \ 2 8ttG 9 , . 

it) =-3-*" (2 - 49) 

with the solution a(t) oc t 2 ! 3 (see Eq. (1.33)). Inserting this solution into the Friedmann equation we 
obtain the explicit time dependence of the background density as 

m = ^ . (2.50) 

How can we describe the dynamics of the perturbation? If the radius of the perturbation was 
continuously increased until the whole universe was covered by the perturbation, we would simply 
have an overcritical FLRW world model, i.e. a universe with K = 1 that is also described by the 
Friedmann equation (1.19). However, due to the Newtonian theorem for spherical symmetric mass 
distributions, our perturbation does not know whether it is imbedded within a critical or overcritical 
background universe as long as the matter is distributed spherically around it. So the perturbation 



11 This condition is, of course, only adopted for simplicity to obtain exact mathematical results. The case of a more 
general perturbation is treated, for instance, in Mukhanov (2005, Sect. 6.4.1). The corresponding behavior of the 
background is identical with our case for large distances from the perturbation. 

12 This remains valid even in a general relativistic treatment due to the Birkhoff theorem and the fact that the back- 
ground and the perturbation are not overlapping (see e.g. Weinberg 2008, Sect. 8.2). 
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must obey a scaled down version of the overcritical Friedmann equation. In the following we will 
study the expanding solutions for such a universe and then apply it to our perturbation. 

Expressed in terms of the cosmological world radius R(t) = Roa(t) (see Sect. 1.2.1) the Friedmann 
equation for a closed universe is 



dRY 8vrG „ 2 2 



dt , . PR' - C . (2.51) 

To find an analytic solution, we express this equation in terms of conformal time 13 

dr = -| dt . (2.52) 
R 

Note that there is a one-to-one correspondence between t and r, whereby these two time coordinates 
have the same zero point. The Friedmann equation (2.51) then becomes 

d R \ 2 „R ( R x 



2 — ) (2.53) 

n (It Rjf J Rjf V R* 

with the constant 

_ 4irG Po R$ _ 1 c n m 

K *~ 3 C 2 " 2 H (n m -if/2- {2 - b4) 

To derive the expressions for R* we used p(t)R 3 (t) = PqRq and 

* = ibn=T' (2 ' 55) 

which is obtained by means of Eqs. (1.40) and (2.51) for an arbitrary reference time to. Equation 
(2.53) has the simple solution 

R(t) = R,(1- cos(r)) , t(r) = f dr' = — (r - sin(r)) . (2.56) 

Jo c c 

What can we say about the initial conditions? Obviously, the evolution of the closed universe 
is entirely determined by i?*, so we have one degree of freedom. Similar to the flat universe (see 
Sect. 1.3.2), the Hubble parameter is determined by the choice of the zero point R(0) = 0, i.e. H(t) = 
R(t)/R(t) = sin(r)/(l — cos(r)) is merely a function of r. The free parameter R* can be fixed, for 
instance, by specifying Rq, po, or J7 m at an arbitrary reference time to- Moreover, for small r, we 
can expand the Eqs. (2.56) to the first nonvanishing order in r yielding 

R(r)^Rj^ t(T )~^. (2.57) 



Inserting t(r) into R(t) leads to 

, 2/3 ^_m = i 

R(t) St 



R^-RonUU-Hot) , H(t) = 



(2.58) 



where we used the Eqs. (2.54) and (2.55). After dividing R(t) by R$, these expressions are identical 
to those of a flat, matter dominated universe (see Eqs. (1.33) and (1-34) for w = 0) up to the factor 



1 In this formulation, the conformal time is dimensionless in contrast to Eq. (1.8). The difference between these two 
formulations is, of course, just the constant factor Rq. 
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Table 2.1. Summary of the different stages of the spherical top hat model. 



Stage r 


t 


<5lin 


(5 + 1 


Turnaround it 


R* 

TT 

C 


A( 6vr )2/3 „ L1 

20 v ; 


% - 5 - 55 


Virialization 2tt 


2tt^* 
c 


l(12vr) 2 / 3 ~ 1.69 


|(2tt) 2 ~ 178 



fim 3 - However, since f2 m (i) — > 1 for i — > 14 , we have $7 m = fi m (io) ~ 1 for small to so that any- 
closed, matter dominated universe is indistinguishable from a flat, matter dominated universe at 
early times. That is for any choice of f2 m , our perturbation approaches the density of the background 
at early times and expands in accordance with it as required. 

Our perturbation, however, has another degree of freedom being its mass M. For a given po and 
mass M, the scale factor R(t) is uniquely determined and the radius of the perturbation is given by 
Rp(to) = (3M/(47rpo)) 1 ^ 3 - Since R p (t) must evolve in proportion to R(t), we introduce a constant C 
such that R v {t) = CR(t). The only constraint on M is that it must be smaller than the total mass 
within our overcritical universe, which formally leads to C < tt, since ttR(t) is half the circumference 
of that universe at time r. With the Eqs. (2.56) and (2.54) the dynamics of the perturbation are 
then given by 

R p (t) = i?* (1 - cos(r)) , it = CR* = ^ . (2.59) 

KJ C 

Using the explicit expressions (2.56), (2.59), and (2.50) for the radius of the perturbation, the con- 
formal time, and the density of the background, respectively, we are able to compute the overdensity 
of a perturbation with mass M as 



P(r) = ( M \ f 1 \ = 9 (r-sin(r)) 2 



5{T) + l = W) = ' \^Gm) = 2(l-cos(r)) 3 • (2 - 60) 



This equation is exact within our simplistic picture and describes the full nonlinear growth of our 
spherical overdensity. Moreover, it even remains exact in the general relativistic framework and is 
valid inside and outside the horizon. Note that 5(r) is independent of the mass of the perturbation 
and becomes zero for small conformal times as expected (cf. Eq. (2.61)). In the following we will 
take a closer look at several states of the evolution of <5(r). The results are summarized in Figure 2.2 
and Table 2.1. 

Linear regime At early times, t < 1, we can expand Eq. (2.60) to the first nonvanishing order in 
r yielding 

S(t) * ^r 2 , (2.61) 



This can be seen as follows: For a universe that started from a big bang and that contains at least one energy 
contribution / with an equation of state wi > —1/3, it holds for early enough times that the second term on 
the right hand side of the Friedmann equation (1.19) always becomes negligible relative to the first term due to 
Eq. (1.27). With the definition of the density parameters (1.40) it follows immediately Qi(t) ~ 1 at these times. 
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Figure 2.2. Evolution of the spherical top hat perturbation as a function of the (dimension- 
less) conformal time r. The growth of the radius R P (t) (in units of i?*) and the corresponding 
overdensity 5(t) of the spherical perturbation are shown by the solid line in the upper and 
lower panels, respectively. At the beginning, Rp(t) grows in accordance with the expansion of 
the background universe R(t) (dashed-dotted line), decouples from the background at r = tt 
(turnaround), and finally collapses. If perfect sphercial symmetry was established, the overden- 
sity would actually collapse into a singularity at r = 2tt (dashed line). However, since perfectly 
symmetric overdensities do not exist in reality, the overdensity virializes at r = 2tt with a 
radius that is about half of its maximal extension. In the lower panel the linear overdensity 
8\m(t) (dashed-dotted line) is shown for comparison. 



so that by eliminating r using Eq. (2.57) we obtain 

Not surprisingly, we recover the relation 6(t) oc t 2 / 3 from the linear perturbation theory for a matter 
dominated universe since the overdensity 5(t) is small at early times. We denote the linear density 

by S]in(t). 

Turnaround As time passes, the perturbation grows and leaves the linear regime. Eq. (2.59) shows 
that for r max = tt the radius i? max = i? p (r max ) finally becomes maximal and the perturbation stops 
expanding. This state is called "turnaround" and marks the epoch when the perturbation decouples 
entirely from the Hubble flow of the homogeneous background. The overdensity at this stage is 
<Kr max ) - 4.55. 
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Virialization After the turnaround the perturbation starts contracting. For a perfect spherical 
symmetry and perfect pressureless matter, the pertubation would collapse to a single point for r co u = 
2ir becoming infintely dense. However, there is hardly any perfect spherical symmetric overdensity in 
the universe, so the perturbation does not collapse to a single point, but rather extreme shell crossing 
occurs and finally a virialized object of a certain finite size is formed which is called halo. 15 To find the 
size of the halo, we search for the radius i? v ir for which the virial condition 2i?km(i? v i r )+£ , P ot(-Rvir) = 
is satisfied. S ince cit the turnaround the kinetic energy is zero, it holds -£^pot(^max 

) = E tot , and since 

the potential energy of a homogeneous sphere of mass M is 

Epot{R) = _ !nr ' (2 - 63) 

we obtain at the radius i? max /2 

TP f -^max \ _ p TP f ^ maX | — TP (JD \ TP I 

-C'kin I — ^ — J ~~ tot ~~ P ot I — 2 — / ~~ ^Poti-KmaxJ — ^pot I — ^ 



- "o^pot 



(2.64) 



as 2E pot (R) = E po t(R/2). This is exactly the virial relation and so we can set i? v ir = -Rmax/2. 

What is the epoch of virialization? Following Eq. (2.59) the conformal time when i? max /2 is 
reached is 3ir/2, however Eq. (2.59) considers only the single stream limit without any crossing of 
trajectories. So virialization takes some additional time and usually the time r v ; r = 2ir is assumed, 
i.e. the epoch when the perfect symmetric perturbation would have collapsed to a point. So, to 
compute the overdensity <5 v ; r of a virialized halo, we evaluate Eq. (2.60) at i? v i r = i? p (37r/2) and 

^vir = t(2ir) 



_ 9 (r-sin(r)) 

^vir — &vir ~\~ 1 — 



2^ = ^(2vr) 2 ~ 178. (2.65) 



2 n ^o^\\3 | 

=3?r/2 

Numerical simulations of collapsing halos show that the choice of t v - 1T = 2t max is of the right order of 
magnitude. For a cluster to reach equilibrium one typically finds t — 3t m ax 

(Coles & Lucchin 2002, 

Sect. 14.1), but we can assume that already for i v i r the density S of the halo is of the right order. 
Like Eq. (2.60), 5 v i r does not depend on the mass of the perturbation and so when we observe an 
overdensity of the order of J v i r , we can assume that the structure is virialized or close to virialization 
irrespective of its mass. 

In the same time the perturbation has grown to an overdensity of 5 v i r , the linear overdensity has 
grown to 

S c = 5 hn {t vir ) ~ 1.69 (2.66) 

being also independent of the mass and the size of the perturbation, since after inserting Eq. (2.59) 
into (2.62) the mass cancels out for any conformal time. So we may postulate that whenever the 
linear overdensity of a perturbation exceeds the threshold 5 C , a halo of overdensity <5 v ir has formed at 
this place. This is in fact a very simplistic approach, but will prove to be astonishingly successful. 

How is our simple picture affected by the presence of a cosmological constant? Eke et al. (1996) 
give an analytical prescription for the case f2 m + = 1 and show that, while S c is rather insensitive 



15 For a DM perturbation there is no interaction between the DM particles (except of gravity) and so there are no 
collisions between them, in contrast to a baryonic gas. Thus the exchange of momentum between particles can only 
occur through gravitation as each DM particle moves in the fluctuating gravitational field of all other particles. This 
process is called "violent relaxation" and it can be shown that it leads to a Maxwellian distribution of velocities 
(Peacock 1999, Sect. 17.1). 
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of the presence of a cosmological constant, A v j r decreases for increasing Q\. Furthermore, very small 
initial perturbations do not collapse any more due to the repelling force of the cosmological constant 
(see also Weinberg 2008, Sect. 8.2). Bryan & Norman (1998) provide a fitting formula for the A v ; r 
of Eke et al. (1996) as 

A vir (Q A ) = 18tt - 82 Q A + 39 n 2 A . (2.67) 
For Qa = we recover the value from Eq. (2.65) and for 17a = 0.7 we find A v i r ~ 101. 



2.3.2. Press-Schechter theory 



In the previous section we have encountered the simplistic concept where a halo forms at a given 
position in space, whenever the linear overdensity field reaches a certain threshold 5 C . We will use 
this concept to estimate the mean number density of halos in the universe at a given time. In 
the following, the number density of halos of mass M at time t is denoted by n^x^M, t) and the 
corresponding mean density by n-h(M, t) = (n^x, M, t)). For ease of notation, we will often suppress 
the time dependence. We mainly follow the presentation in Section 16.3 of Longair (2008). 

Suppose the filtered density field smoothed over a certain mass M is 5m (see Sect. 2.2.3). The 
main postulate of the Press-Schechter approach is the assumption that if SM{t,x) for a given point 
x is larger than a certain threshold 5 C this point is contained within a halo of mass >M. As it holds 
5m (t, r) — > for M — > oo, we will always find a mass M > M such that 5j^(t, r) = 5 C and this is the 
mass associated to the halo at the position x. Since 5m (t,x) is a zero mean Gaussian random field 
with standard deviation cjm, the probability that at a random point 5m exceeds this threshold 5 C is 



p{<?m) 



2iraM 



f 



exp 



2 ^M 



dx 



1 



(2.68) 



where v 



5 C /&M and the error function erf(x) = 2/y^r J Q e~ y dy. Since a halo of mass M has 
effectively swept up the mass within a comoving volume V(M) = poM, we will consider 5 C at random 
positions x^, i = 1,2,... such that their associated volumes V(M) do not overlap. The fraction of 
such points with 5M(t, x i) > 5 C for masses within [M,M + dM] is then simply (dp/ dM){<TM)dM . 
The mass function (or "multiplicity function"), which is the mean number of halos of mass M per 
unit comoving volume and unit mass, is then given by 



dn^ 
dM 



(M) 



dp 



V(M) dM 



t po dp da M 
' Mdo M dM 



2^ po daM v e _y2 / 2 



ttM(j m dM 



(2.69) 



where n^M) = (nh(x,M)) is the mean comoving number density of halos of mass M. Note that 
we have multiplied the whole expression by an additional factor of 2 in order to be consistent with 
simulations (see the discussion below). The formula (2.69) was first derived by Press & Schechter 
(1974) and is therefore called the "Press-Schechter mass function". To roughly obtain its explicit 
mass dependence we use the scaling relation (2.46) so that 



3+w cff 

A*(t)M 6 



(2.70) 



V2 V2a M (t) 

where A*(t) oc -D _1 (i) is a time dependent normalization factor and n e ff(M) the effective spectral 
index evaluated at the corresponding mass M. The Press-Schechter mass function (2.69) can then 
be approximately expressed as 



dn^ 
dM 



(.V)-^#|^M^ex P (-^ W M-) 



7T 



(2.71) 
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with j(M) = 1 + n e g(M)/3. Thus the mass function at a given time is proportional to po oc VL m h? 
and approximately scales like a power law in mass with an exponential cut off at the high mass end. 

What can we learn from this analysis? It became clear that the Press-Schechter approach cannot 
be regarded as a "rigorous derivation". It is not only based on very simplistic assumptions, but 
also needs certain ad-hoc modifications, such as the multiplication by an additional factor of 2. The 
initial assumptions do not only neglect to large part the nonlinear evolution of the density field, they 
also suffer from at least two major drawbacks: First, being based on the spherical top hat model 
we assumed that the collapse of the DM halos is a spherical symmetric process, while in reality the 
halos can have complicated three-axial shapes. Second, there is the "peaks-within-peaks problem" 
asserting that the Press-Schechter approach does not take into account whether a halo of a certain 
mass is included in a halo of some larger mass. Despite these caveats, the comparison between the 
Press-Schechter approach and numerical simulations showed that the former gives roughly the right 
shape of the mass function and is correct up to an order of magnitude (see Fig. 2.3). In particular, 
it shows the exponential dependence on cjm (and thus on erg) at the high mass end. 

Since the publication of the Press-Schechter mass function there has been great effort to deal with 
the previously mentioned caveats. For reviews of the most important improvements we refer, for 
example, to Zentner (2007) and Mo et al. (2010, Sect. 7.2). Today, one typically uses simulation 
calibrated formulas or fitting formulas derived from simulations (e.g. Sheth & Tormen 1999; Jenkins 
et al. 2001; Sheth et al. 2001; Reed et al. 2003; Warren et al. 2006; Tinker et al. 2008; Pillepich 
et al. 2010). In earlier work it was suggested that there might be a "universal mass function" 
(i.e. same functional form and numerical parameters) for different cosmologies and over a broad 
range of redshift. However, most recent studies have shown that if one aims at a precision of < 5% 
such a universal mass function cannot be found, neither for different cosmologies nor for a broad 
redshift range. For a ACDM-cosmolgy and for the redshift range z < 1, fitting formulas at a 
precision of a few percent in the mass range relevant for cosmological studies of the LSS are provided 
by Tinker et al. (2008) and Pillepich et al. (2010). These accuracies should, however, be taken with 
with caution. Uncertainties in the halo mass function are not only introduced by the definition of 
halos in simulations (see e.g. White 2001 and More et al. 2011 for a discussion), but also by effects 
due to the gas physics of the baryon matter, which may cause deviations in the mass function of the 
order of 30% (Stanek et al. 2009). 

2.3.3. Linear bias 

The Press-Schechter approach does not only open the door for an analytic calculation of the mean 
number density of DM halos, it also allows insight into how these DM halos are correlated in space. 
This leads to the concept of "bias" (Kaiser 1984). 

In a first step we express the auto-correlation function of halos of mass M in terms of their comoving 
number density iih{x, M). With the overdensity of halos of mass M (cf. Eq. (2.15)) 



where n-h(M) = (n^x, M)), the corresponding halo auto-correlation function is (cf. Eq. (2.28)) 



5 h (x,M) 



n h {x,M)-n h (M) 
n h (M) 



(2.72) 



U(r,M) = (fi b (x,M)5 h (x',M)) = 



(n h {x,M)n h (x',M)} 
n h (M)2 



-1, 



(2.73) 



where r = \x' — x \ . This formulation has the straightforward interpretation of the correlation function 
as a measure of the excess of halo-pairs at separations r compared to the mean number density of 
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Figure 2.3. Mean mass density p^{M) = Mn^M) of halos for two different redshifts. The 
mass density ph(M) instead of the mass function dn/dM is shown for clarity, as it allows to 
squeeze the y-axis. The black lines correspond to the Press-Schechter mass function (2.69) and 
the red lines to the fitting formula of Pillepich et al. (2010). The solid lines refer to z = and 
the dashed lines to z = 1. For the relation v{M) a concordance cosmology was assumed and 
the transfer function T(k) was taken from Eisenstein & Hu (1999). 



the halos. In the following we try to relate the halo correlation function £hh to the linear correlation 
function £ij n , which is the Fourier transform of the linear power spectrum (2.34). 

The Press-Schechter mass function gives us the mean number density of halos, but it does 
not tell us how it varies from place to place or how it depends on its cosmic environment. The 
probably simplest way to show how the local number of halos depends on the environment is the 
Peak-Background split (Bardeen et al. 1986; Cole & Kaiser 1989; Mo & White 1996). Suppose 
we have an overdensity field 8{x) that can be decomposed into a short wavelength part <5h and into 
a long wavelength background part <5b such that 

5 = 6 h + 5 h . (2.74) 

The short wavelength perturbation 5^ is the progenitor of the halos we want to study and the long 
wavelength perturbation <5b plays the role of a smooth background density being in the linear regime, 
i.e. it holds 5^ <C 1. We will assume that 5^ is essentially constant over the region where 5^ collapses 
into the halos. The effect of 5^ is to perturb the critical threshold that the linear part of <5h has to 
reach for a collapse. If the linear part of Sh reaches the effective threshold 

<5 C = 5 C - <5 b , (2.75) 

the linear part of the total perturbation 5 reaches the actual threshold 5 C that is needed for a 
structure to collapse. The effective threshold 5 C depends on the linear background field <5b and causes 
the fluctuation of a given strength to collapse at different places at slightly different times. This 
causes the local number density to vary from place to place depending on 5^. 
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A quantitative estimation of this effect can be gained by using the explicit form of the Press- 
Schechter mass function (2.69) 



dM 



(M, S c ) <x v(5 c )e 



Since (5b 1, we can expand to first order after <5b yielding 



dn\i 



dn-h 
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(2.77) 



Thus we find the result 
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(2.78) 



In the analysis so far, we have assumed that the positions of the halos that form during the growth 
of 5 remain unchanged relative to each other during the evolution of 5. This is, however, a very 
crude approximation. To obtain a more realistic picture, we have to account for the fact that the 
region of the background density 5^ shrinks during its linear growth and thus moves the halos that 
have been created closer together as time goes by. This means that the halos which collapse at the 
time when the background field reaches the strength 5^ were farther apart at earlier times. Since the 
background fluctuation 5^ was initially a very small perturbation (5; <C (5b, the current region of the 
background density was once smaller by a factor p^/p{l + 5\) ~ Pb/p = 1 + <$tv Taking this factor into 
account we obtain a more accurate relation between the overdensity of halos and the background 



S h (M) = (1 + <5 b ) 



vo 



1 r (, V 

(5 b ~ 1 + 



vo 



(5b , 



(2.79) 



where we have neglected terms of second order in 5^. 

Since we know the correlation of the linear density field <5n n , we are finally able to compute the 
correlation of halos. Reckoning the definition of the correlation function and the fact that 5^ played 
the part of the linear theory, it follows immediately from Eq. (2.79) that 



^ hh (r, M) = b\M) 6in(r) , 6(M) = 1 + 



vo 



(2.80) 



where b is called linear bias. This result shows that DM halos are biased tracers of the underlying 
mass field with a bias depending on the mass of the halo. The higher the mass the stronger the bias. 
Note that this result is only valid for scales r within the linear regime (Seljak 2000; Cooray & Sheth 
2002). The bias for the nonlinear regime would be scale dependent. 

The relevance of this simple bias model is similar to that of the Press-Schechter mass function. It 
allows an understanding of the general behavior of the bias within our cosmological framework, but 
is less suited as a tool for a precision cosmology. A comparison of our simple model with the fitting 
formuals of Tinker et al. (2010) and Pillepich et al. (2010) are shown in Figure 2.4 for two different 
redshifts. The deviations of our simple model from the results of numerical simulations are <20%. 
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Figure 2.4. Bias of DM halos as a function of mass M for two different redshifts. The black 
lines correspond to the linear bias given by Eq. (2.80), the green lines to the fitting formula 
of Tinker et al. (2010), and the red lines to the fitting formula of Pillepich et al. (2010). The 
solid lines refer to z = and the dashed lines to z = 1. For the displayed mass, the accuracy 
of the linear bias is <20% and the relative difference between the two fitting formulas is <5%. 
For the relation v{M) a concordance cosmology was assumed and the transfer function T(k) 
was taken from Eisenstein &: Hu (1999). 

2.3.4. The halo model 

So far we have only considered the DM part of the universe. Unfortunately, it is hardly possible to 
measure DM halos directly. So it is an important question how to connect the theory that has been 
developed in this chapter to the "bright part" of the universe which can be easily observed. How do 
the galaxies fit into this framework? 

To answer this question we would need a theory on how galaxies form within the DM framework 
discussed so far. By now, many details of this process are not well understood. It is, however, 
well established that galaxies form at the centers of DM halos as baryonic matter falls into the halo 
and cools (White & Rees 1978, cf. the scenario discussed in Sect. 3.5.1). These are called "central 
galaxies". During the evolution of the LSS, some of the DM halos merge to build larger halos. If a 
big halo merges with a small one, the galaxy of the small halo becomes a "satellite galaxy" within 
the resulting halo, while the galaxy of the big halo remains the "central galaxy" . Central galaxies 
and satellite galaxies might evolve differently over cosmic time owing to their different places within 
the halo. In the course of time more and more galaxies get assembled in DM halos. A DM halo 
containing several galaxies is called a "galaxy group" or, in the case of a very huge halo containing 
hundreds of galaxies, a "galaxy cluster" . 

This is the theoretical foundation of the halo model (Peacock &: Smith 2000; Seljak 2000), which 
is an intriguing simple attempt to describe correlation function of galaxies and galaxy groups in the 
linear and nonlinear regime. It is based on a few straightforward assumptions in the light of the 
framework that we have sketched above (see e.g. Cooray & Sheth 2002 Ch. 4 and 5 for a review): 
First, all galaxies reside within halos according to a certain spherical density profile, where there is 
always a galaxy at the center of the halo. Second, the distribution of the number of galaxies in a 
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halo p(N\M) and their spatial distribution depend for a given galaxy sample only on the mass M of 
the halo. The distribution p(N\M) is called halo occupation distribution (HOD) and is the main 
ingredient to the halo model. 16 The dependence of the galaxy populations from the halo mass is the 
reason why different galaxy population cluster differently. Further common assumptions are that 
the galaxy density profile within halos follows that of the DM or that central and satellite galaxies 
constitute different galaxy populations. 

All of these assumptions are reasonable to some extent or are warranted by observations (e.g. Cooray 
& Sheth 2002), but they also show the limitations of the halo model. The assumption, for instance, 
that the spherical distribution of galaxies within halos depends only on the Mass of the halo is cer- 
tainly a mere approximation, since it is well known that even the DM profile of a halo of a certain 
mass varies from halo to halo within some range. There are also discussions in the literature about 
the dependency of the HOD on the cosmic environment in addition to the mass of the halo (e.g. 
Gil-Marin et al. 2011; Croft et al. 2011). 

In the following we develop the formalism of the halo model keeping it as simple as possible. 
Suppose we have two samples of galaxies g and g' , respectively, with no intersections between the 
samples. From the definition of the correlation function in the form of Eq. (2.73) and the assumption 
that all galaxies reside within DM halos, it follows immediately that the the cross-correlation function 
£ gg ' between these samples divides into two terms, i.e. 

e gg '(0 = ^ 1) w + e gg ? ) (r), (2.8i) 

where the one-halo term £^ contains the contribution from galaxy pairs within the same halo and 

the two-halo term £^ from pairs within different halos. We can derive approximate expressions 
for these two terms from the basic assumptions. 
For the one-halo term, we get 

^' )(r) " / ^W^^/«g(|r'|,M)t V (|r'-r| > AO^ S dM, (2-82) 

where dn^/dM is the mass function of halos, n g and n g > are the mean number densities of the galaxies, 
u g (r, M) and u g > (r, M) are for either galaxy sample the normalized mean radial galaxy density profiles 
within halos of mass M, and (N g N g /\M) is the mean number of pairs determined by the HODs of 
g and g' . It is a common practice to set u g ~ u g > ~ with Uh(r, M) the normalized mean DM 
density profile for the halos of mass M. In Eq. (2.82) we have not accounted for the assumption 
that there is always a galaxy at the center of the halo. An analytic approximation to deal with 
this complication is given in Cooray & Sheth (2002), whereby the integral over the convolution of 
the density profiles reduces to u g if (N g N g >\M) < 1. Since the dependence of on r comes only 
from the density profiles it is clear that it contributes only on scales comparable to the extension 
of the halos (< 1 Mpc). On larger scales it can be neglected. Since convolutions become simple 
multiplications in Fourier space, the one-halo term becomes much simpler if expressed by means of 
the power spectrum. This is why the formalism is usually developed in Fourier space. 
The two-halo term can be approximated by 

£ g f (0^ Vv6in(r), (2.83) 

16 Alternatively to the HOD, some authors use the conditional luminosity function (CLF) &(L\M)dL instead being the 
average number of galaxies with luminosity L residing in halos of mass M. The two approaches are equivalent to 
each other if the galaxy sample in the case of the HOD is selected by luminosity (see e.g. the comments in Skibba 
& Sheth 2009, Sect. 1, and Zehavi et al. 2011, Sect. 2.3). 
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where we have introduced the linear bias b g and b g > for the two galaxy species, respectively, as 

with (Ni\M), i = {g,g'}, the mean numbers of galaxies in halos of mass M determined by the HODs. 
In Eq. (2.83) we have neglected the extensions of the halos and formally just placed all galaxies at the 
centers of the halos which is a good approximation for scales much larger than the typical extension 
of a halo. 

As for the case of the cross-correlation function we can similarly write down the correlation function 
for the following three special cases: 

• For the auto-correlation function £ gg of the species g the one- and two-halo terms reduce to 

- / g^(M) (iVs(iV ;; 1)|M) / % (|r'|,MK(|r' -r|, M) dr> 3 dM (2.85) 

and 

^ 2) (r) - bl 6i„(r) , b g = f |g (M)6(M)^p dM . (2.86) 

Note that the modification in the one-halo term (N g (N g / — l)) is due two the fact that the 
autocorrelation function is a cross-correlation between two samples with intersecting points. 

• For the cross-correlation between the galaxy species g and halos h in the range M m ; n < M < 
M m3iX , the one- and two- halo terms become 



(hi)^_7M min dM n g fM]2!/ 
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• The DM halo auto-correlation function £hh( r ) consists for r / only of the two-halo term, i.e. 

dn^ 



, , (M)b(M) dM 

Chh(r) = d 2) (r) * ^ 6in(r) , 6 h = Lj %~ • (2-89) 

j d ^M)dM 

For the autocorrelation function of DM particles £dd we have 

($£\r) - / ^( M )y Q J u h (\r'\,M)u h (\r'-rlM)dr ,:i dM (2.90) 

df(r-) ^ 6§ 6in(r) , 6 d = | ^(M)jb(M) dM . (2.91) 



and 
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So we find the neat result that in the linear regime, any correlation function is proportional to £n n 
with a constant bias. This is, however, only true for the linear regime, where the one-halo term is 
negligible. As soon as the one-halo term becomes important, we enter the nonlinear regime and the 
bias becomes scale dependent (Seljak 2000; Cooray & Sheth 2002). This means, for instance, that 
the galaxy-galaxy correlation function £ gg is not just a scaled version of the DM correlation function 

£dd- 

Unfortunately, the halo model cannot be tested by measuring the correlation function £(r) for 
galaxies (or other objects) directly. Since the distance to galaxies is measured using their redshift z, 
we can only determine the positions of the galaxies in comvoving redshift space, but not in comoving 
real space (see Sect. 1.4.2). That is the positions of galaxies include a small random component along 
the line of sight and hence the corresponding correlation function appears distorted. To deal with this 
difficulty, we can estimate the correlation function for galaxy separations parallel and perpendicular 
to the line of sight. If s is the separation vector of galaxy pairs in redshift space, we can estimate 
£(|s|||, \s±\) for s = S|| + where sy is the component parallel and s± perpendicular to the line 
of sight. By integrating this correlation function along the line of sight, we obtain the projected 
correlation function 

/co poo 
£(|s|||,|aj_|)da|| = 2 / £(*-,r p )dir, (2.92) 
-oo JO 

where we denoted ir = \sn\ and r p = \s±\, which is independent of the redshift space distortions. 
The projected correlation function can then either be converted to the real space correlation function 
£(r) by means of an inverse Abel transform (see e.g. Peacock 1999, Sect. 16.5) or can be directly 
compared to the corresponding project correlation function from the halo model. The latter is shown 
in Figure 2.5 for the projected correlation functions from the big low-redshift galaxy surveys 2dfRGS 
and SDSS. It is evident that the halo model is very successful in reproducing the correlation function 
in both the linear and nonlinear regime for different galaxy samples. 
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Figure 2.5. Comparison of the halo model to data from 2dfGRS and SDSS in terms of the 
projected correlation function w(r p ). 

Left panels (adapted from Collister & Lahav 2005): In the upper panel the data points represent 
the observed projected correlation function (divided by r p ) for galaxies within 2dFGRS, the 
dot-dashed line is a power-law fit to the data, the solid line is the prediction from the halo 
model, and the dotted lines show the corresponding one- and two-halo terms. It can be clearly 
seen how the one- and two-halo terms approximately sum up to a power law in the range from 
0.1 h~ l to 10/j -1 Mpc. Note that the solid line is not a fit to the data, but shows the prediction 
of the halo model for the HOD that was measured by means of galaxy groups within 2dfGRS. 
Thus the curve constitutes a nice self-consistence test of the halo model. The lower panel shows 
the ratio of the data points and the solid curve to the power law fit. 

Right panels (taken from Zehavi et al. 2011, reproduced by permission of the AAS): The upper 
panel shows the observed projected correlation functions (data points) and the corresponding 
halo model fits (solid lines) for different volume-limited galaxy samples from SDSS as indicated 
within the panel. For the brightest samples (red curves) the transition from the one-halo to the 
two-halo term around r p ~ 1.5 br 1 Mpc is clearly visible. Note that the correlation functions 
are each staggered by 0.25 dex for clarity. The lower panel shows the corresponding HODs 
as a function of halo mass for the different galaxy samples. It is shown for the brightest 
sample how the HOD combines from the HOD of central (dashed line) and of satellite (dotted 
line) galaxies. 
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Chapter 




General relativistic treatment of linear 
structure formation 

In the previous chapter we presented the theory of structure formation in the context of Newtonian 
physics. However, this theory is only valid well inside the horizon and only in a spatially flat universe. 
If we want to understand how initial perturbations were created during inflation and how they evolved 
to the present time, we also have to study perturbations outside the horizon which is only possible 
by a pur ley general relativistic treatement. 

In this chapter we give a brief introduction to the full general relativistic treatment of linear struc- 
ture formation. Our goal is to complement and confirm the results from our study of perturbations 
in the Newtonian regime and to understand the ingredients of modern cosmological codes like CMB- 
fast which produce the most accurate transfer functions T(k) for cosmology. We will see how the 
Newtonian treatment in Section 2.1 arises naturally as a limiting case for perturbations well inside 
the horizon, and in the next chapter we will apply the general relativistic framework developed in 
this chapter to derive the power spectrum of perturbations that is created by the simplest scenario 
of inflation. 

The term "linear theory" formally means that we perform our calculations with a set of per- 
turbation quantities which are very small (of the order of 1CT 5 in the early universe such as the 
relative amplitude of the CMB temperature fluctuations) and always keep only terms that are linear 
in perturbation quantities. Due to the smallness of the perturbations, this approach is very accurate 
in the early universe, and an immediate consequence of this procedure is that the resulting field 
equations and equations of motion are linear differential equations. This simplifies the analysis a lot 
and enables us to solve these equations independently of the (stochastic) initial perturbations that 
were created during inflation. The corresponding theory was first developed by Lifshitz (1946) in a 
remarkable paper treating the problem of relativistic structure formation with impressive generality. 1 
In this chapter we will mainly follow Seljak (unpublished lecture notes), Durrer (2008), and Weinberg 
(2008). 



1 Bertschinger (1995) commented on Lifshitz's paper: 

This classic paper was remarkably complete, including a full treatment of the scalar, vector, and tensor 
decomposition in open and closed universes and a concise solution to the gauge mode problem; it presented 
solutions for perfect fluids in matter- and radiation-dominated universes; and it contrasted isentropic 
(adiabatic) and entropy fluctuations. 
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We will entirely stick to the case of a spatially flat universe (K = 0). This condition further 
simplifies our calculations enormously, since it allows us to decompose the perturbations into the 
familiar Fourier modes. This restriction is justified by the measured 95% confidence limit on the 
present day curvature being —0.0133 < fix < 0.0084 (see Table 1.1) and by the fact that according 
to the discussion in Section 1.6.1 the early universe was even much natter than the present day 
universe. For a treatment of linear perturbation theory in non-flat universes we refer, for instance, 
to Kodama & Sasaki (1984), Hu et al. (1998), and Durrer (2008, Ch. 2 and App. 9). 

Finally, we want to briefly introduce some conventions on the notation we are going to adopt. 
Greek indices p,, v, etc. generally run over the four spacetime coordinates, while latin indices i, j, 
etc. run only over the three spatial coordinates. Repeated indices are automatically summed over. A 
quantity with a bar (e.g. p) denotes its unperturbed value of the background FLRW universe. We will 
adopt the 1+3 formalism, i.e. x = (r, x) with r the conformal time and units such that c = 1. Bold 
symbols refer to spatial, 3-dimensional quantities. A dot (e.g. p) denotes the derivation with respect 
to conformal time r and spatial derivatives are abbreviated by di = d/dx l . Spatial hypersurfaces of 
constant conformal time r are called "slices" . 

3.1. Perturbations 
3.1.1. Perturbed metric 

Treating the universe as perfectly homogeneous and isotropic leads automatically to the FLRW 
framework described in Chapter 1. Using conformal time r (see Eq. (1.8)) instead of cosmic time t 
the Robertson- Walker metric takes the form of Eq. (1.9) and reduces for K = to the very simple 
expression 

ds 2 = g^dx^dx" = a 2 (r) (-dr 2 + cbfda?) , (3.1) 

where jij = 5ij for cartesian comoving coordinates x. Here, a(r) is dimensionless, while r and x 
take units of length. In these coordinates, the energy momentum tensor T^ v taking the form of an 
ideal fluid reads as 

= {p + p) u»u u + p r v = ^diag (p, p, p, p) , (3.2) 

where p(r), p(r), and u M (r) = a _1 (l, 0,0,0) are the mean energy density, mean pressure, and mean 
4-velocity, respectively, of the fluid. Instead of the Hubble parameter H we will always use the 
conformal Hubble parameter % = a/ a = Ha. 

The homogeneous and isotropic FLRW universe does not contain any structures and can thus be 
regarded as a treatment of structure formation at "zeroth order". The "first order" (or "linear") 
treatment is based on the assumption that, at early times, the metric g^ u can be approximated by a 
perturbed Robertson- Walker metric, i.e. 

9nv = 9^u + 8 gpv , (3.3) 

where 5g^ v is a small perturbation that will be treated only to first order. We can write the pertur- 
bation 5g^ u in general as 

5g^dx^dx u = a 2 (r) [-2A dr 2 + B< drdx i + 2 (H L Sij + Hy) dx l dx j ] , (3.4) 

where the perturbations A(t,x), Hl(t,x), Bi(r,x), and Hij(T,x) are functions of x = (t,x) and 
Hij is symmetric and traceless. (Note that 2>Hl basically plays the role of the trace of Hij and was 
taken out from for later convenience.) 
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The transformation behavior of 8g^ u (and similarly for any other perturbed quantity) for a coor- 
dinate transformation x — >■ x' is with 



given by 



dx a dx 13 
9 ^ X) ~ dx~^ 9a ^ X) 



+ Sg^x) . 



(3.6) 



Since the background metric g^v is invariant under spatial translations and rotations on a slice 
of constant conformal time r, the metric perturbation 5g^ v behaves like a 4-tensor under these 
transformations. Thus, it is easy to see that, under spatial translations and rotations, A and Hl 
transform like 3-scalars, B{ like a 3-vector, and Hij like a 3-tensor. From now on, whenever we talk 
about geometrical quantities on a slice of constant r, we refer to their transformation behavior under 
spatial translations and rotations. 

The full perturbed metric is with Eq. (3.4) given by 



(3.7) 




3.1.2. Perturbed energy-momentum tensor 

Similarly to the metric tensor, we decompose the perturbed energy-momentum tensor T^ v into its 
FLRW part and a small perturbation 



/IV 



(3.8) 



In a perturbed universe, the form of T^ v is in general not restricted to the form of an ideal fluid 
anymore. However, we will assume that it at least takes the form of a real fluid 



(p + P) u^u u + pg fiu + IV , 



(3.9) 



where p(r, x) = p(r) + (5p(r, x) is the energy density, p(r, x) = P(t)+5p(t, x) the pressure, u^(t, x) = 
U/i{t) + 5u^{t,x) the 4-velocity, and II Mi ,(T, x) the anisotropic stress of the fluid. The quantities 
8p, 8p, Sun are treated as small perturbations. Since the anisotropic stress U^ u has no counterpart 
in the ideal fluid of the FLRW background, it is treated as a perturbation as well. Furthermore, the 
anisotropic stress obeys the restrictions 



n, 



n™«" = o , 



if, 



o 



(3.10) 



i.e. it is a traceless, symmetric 4-tensor perpendicular to the 4-velocity u^. The normalization of 
is given by 

g^uW = -1 , (3.11) 
so the general form of 5u^ at first order reads as 2 

1 



SvP = - (-Ad) , 



5u, 



9^u u - g^v? = a(-A, B + v), 



(3.12) 



2 We have to be careful in raising/lowering indices of perturbation quantities. The full perturbed quantity has to be 
always considered for the multiplication with the metric. The perturbation quantity with the lowered/raised index 
is then obtained by subtracting the corresponding unperturbed quantity with lowered/raised index (cf. Eq. (3.6)). 
See also the discussion of gauge transformations in Section 3.4. 
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where v is the (peculiar) velocity of the fluid. With this form of Su^ the condition U^u^ yield at 
first order n 00 = n i0 = 0, and thus U\ = 11% = 0. So II can be restricted to its spatial part Iljj. 
Taking everything together, the energy momentum tensor T^ v becomes at first order 



T m = a 2 [p(l + 2A)+6p] 

T i0 = T i = -a 2 [p Bi + (p + p) Vi] 

Tij = a 2 p [(1 + 2H L ) Sij + 2H i:j } + a 2 5p + a 2 n„ . 



(3.13) 



Analog to the perturbation part of the metric 5g^ v , the perturbation part 5T^ U of the energy mo- 
mentum tensor is a 4-tensor under spatial translations and rotations, since T^ u is invariant under 
these transformations. So Sp and Sp are 3-scalars, Vi a 3-vector, and H^- a 3-tensor. 

If the fluid T^ u is made up of several non- interacting fluids [Tj]^, I — 1, . . . , N such that 



N 



flU 



(3.14) 



i=i 



we immediately see using Eq. (3.13) that the perturbation quantities add as follows: 

N N N N 



u 



1=1 



1=1 



1=1 



1=1 



(3.15) 



3.2. Scalar-Vector-Tensor (SVT) decomposition 

Our goal is to solve the field equations and equations of motion at first order. However, with 
the general first order metric and the general first order energy momentum tensor as given by the 
Eqs. (3.7) and (3.13), respectively, this would lead to horribly complicated equations. 3 Fortunately, 
the rotational symmetry of the underlying homogeneous FLRW universe allows us to decompose 
the perturbations into 3-scalars, divergenceless 3-vectors, and divergenceless, traceless, symmetric 
3-tensors. This will simplify our analysis considerably, since these different contributions are not 
coupled to each other by the field equations or equations of motion. We will first describe this 
decomposition in real space and then move into Fourier space for a more detailed analysis. Finally, 
we give a proof of the decomposition theorem. 



3.2.1. SVT decomposition in real space 

For the decomposition, we can fully stay in comoving space. That is we scale out the expansion 
of the universe such that the 3- metric becomes 7^' = cT 2 gij with 7^ = 8^. So the metric of 
the background becomes trivial. 4 In the following we consider the 3-scalar S(t,x), the 3-vector 
Vi(r,x), and the traceless, symmetric 3-tensor Dij(r,x) being arbitrary perturbation quantities in 
our comoving frame. 

3 See e.g. Weinberg (2008, p. 219-224) who derives the field equations and equations of motion in real space by "brute 

force" . He then calls the obtained system of equations "repulsively complicated" . 
4 Since 3-vector and 3-tensor perturbations do not have a corresponding unperturbed quantity in the background 

FLRW universe, raising and lowering indices is performed by applying 7;j to the perturbation quantity, e.g. 

V 1 = 7 lJ {Vj + V 3 ) - rVj = 5 ij V j = V t , (3.16) 
since Vj = 0. Thus raising and lowering indices becomes trivial. 
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The analysis of the 3-scalar S is trivial, since S cannot be decomposed any further and so has 
just a scalar part, i.e. S = S^ s \ However, it is always possible to decompose the 3- vector Vi into a 
3-scalar part ("gradient") and a divergenceless 3-vector part ("curl"), i.e. 5 

Vi = t V^ + V} V) , (3.17) 

where 

diV* = . (3.18) 
Similarly, a spatial traceless, symmetric 3-tensor field can always be decomposed as 

D ij = fa - ^%a) 4 5) + \ (djrfP + d t D { p) + D<P , (3.19) 

where 

d l D { p = D { p l = djDp 3 = . (3.20) 

(S) (V) (T) 

Here Dj, transforms like a 3-scalar, D T like a 3-vector, and D^- like a 3-tensor. This is the 
scalar-vector-tensor (SVT) decomposition in real space. 

To see how this decomposition arises, we briefly describe how it can be constructed. To obtain 
Eq. (3.17), we define as the solution of 



and then simply by 



AV {S) = diV 1 (3.21) 



Vf v) =Vi- diV {s) . (3.22) 



Similarly, to obtain Eq. (3.19), we can define as the solution of 



d i d j (didj - SijA) D { p = &&Dij , (3.23) 



D, V ^ as the solution of 



ADP = cPDji - di 



8 a d b (d a d b - 6 ab A) D { p (3.24) 



(T) 

and we finally set D\- to 



D<P = - (d^j - 4 5) - \ (d 3 D { P + diDp) . (3.25) 

According to this construction it is obvious that the conditions in the Eqs. (3.18) and (3.20) are 
satisfied automatically. 



5 Since the spatial metric jij — jij + 8^/ij is slightly curved, we have to apply covariant derivatives Vi, but with S, Vi 
and Dij being perturbations the covariant derivative Vi reduces to the usual derivative di at first order. 



3. General relativistic treatment of linear structure formation 



60 



3.2.2. SVT decomposition in Fourier space 

The meaning of the SVT decomposition is much easier grasped in Fourier space. Since the background 
FLRW universe is spatially flat and it holds \Sjij/^ij\ <C 1, we can decompose each perturbation 
quantity 5Q(t,x) in Fourier modes e lkx on each slice of constant r, i.e. 6 



5Q(r,k) = / 8Q(r,x)e 



-ikx 



dx 6 



5Q(t, x) 



(2^ 



5Q{t, k)e ikx dk 



(3.28) 



Now consider an arbitrary Fourier mode k and choose two normalized vectors e± and e 2 perpendicular 
to k, so that the set {ei,e2,fc} constitutes an orthonormal basis for our (unperturbed) comoving 
spaces. 7 Then Fourier transforming the real space SVT decomposition of the 3- Vector Vi (see the 
Eqs. (3.17) and (3.18)) we immediately see that the scalar part must be parallel to k in Fourier 
space, while the vector part must be perpendicular to it. Similarly, Fourier transformation of the 
SVT decomposition of the 3-tensor D, L j (see the Eqs. (3.19) and (3.20)) shows that the scalar part 
has two components along k, the vector part has one component along and one perpendicular to 
k, and the tensor part has two components perpendicular to k. With this information we are able 
to construct a basis for 3-scalars, 3-vectors and traceless, symmetric 3-tensors in Fourier space such 
that each basis element is associated to either a scalar, vector, or tensor part. Expressed by means 
of the helicity basis 



e ± = 



1 



[ei ± ie 2 ] 



the three sets of basis elements are: 8 



(3.29) 



3-scalar: 



S<® = 1 . 



(3.30) 



• 3-vector: 



(0) _ 



-ih , V, 



(3.31) 



traceless, symmetric 3-tensor: 
T>\^ = —kikj + -dij , 



(±i) _ 

2 



kj 6.- -\- ki 6.- 



°To be precise, on a manifold with metric 7^ the integral reads as 



8Q(t, ft) = J 8Q(j,x)e %hx \J~f(j, x) dx :i 



V {±2) = e ± e ± 



where 7 = | det(7ij)|. However, with \5~fij/~/ij\ <Si 1 the determinant is at first order 

det(7 i;) ) = det(7ij + 5^ zj ) = det(7y) + tr(^7 ■ 7 _1 ) = 1 + tr(57 ■ 7 _1 ) . 



(3.32) 



(3.26) 



(3.27) 



Thus when integrating over a perturbation quantity, we can neglect the term of order tr(<$7 • 7 _1 ) in Eq. (3.27), 
since we restricted ourselves to a first order calculation. 
7 Note that our space is flat and so equal to the tangential space at any point. This is why we can define the basis 
{ei, e 2 , fc} globally without explicitely specifying a tetrad. Also note that the vectors and tensors in our tangential 
spaces are complex, because we are in Fourier space. So any vector can be represented by the basis {ei, e2, k} with 
complex coefficients. 

8 These basis elements are a representations of the so-called harmonic functions Y, Yi, and Yij for K = (up to the 
factor e lfca! which is omitted in our basis). In general, the harmonic functions are also defined for non-flat FLRW 
universes and are eigenfunctions of the generalized Laplace operator Vj-Vj^ 1 -', where 7ij is the spatial metric of the 
background FLRW universe and V 1 the covariant derivative. For each K the set of these solutions constitutes a 
complete set for decomposing 3-scalars, 3-vectors, and 3-tensors, respectively. For a summary of the properties of 
the harmonic functions see the Appendix C of Kodama & Sasaki (1984). 
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It is easy to see that basis elements with m = are associated to the scalar parts, those with m = ±1 
to the vector parts, and those with m = ±2 to the tensor part. This is the SVT-decomposition 
in Fourier space. The meaning of the index m will become clear, when we study the behavior of 
these basis elements under rotations. The sets of bases contain 1 element for 3-scalars, 3 elements 
for 3- vectors, and 5 elements for 3-tensors according to the degrees of freedom of 3-scalars, 3-vectors, 
and traceless, symmetric 3-tensors, respectively. Thus to proof that these sets indeed are bases, we 
just have to show that their elements are linear independent. This is easily done by introducing the 
inner product (• , •) defined by the hermetian contraction of vectors or tensors as follows 



V^vf'} = VW* Vf r = <W , m,m' = 0,±1 (3.33) 
vlfMf) = 2> (m) M mr = <W, m,m' = 0,±l,±2. (3.34) 



Thus the 3-vectors V^ m \ for m = 0, ±1, and the 3-tensors T^^, for m = 0, ±1, ±2, are all orthogonal 
to each other and hence automatically linearly independent. 

The meaning of the tacitly introduced index m describes the transformation property of the basis 
elements under spatial rotations. If we rotate e\ and e2 counterclockwise around k by an angle ip, 
we find immediately 

e± = e ± . (3.35) 
So the basis elements transform according to their definition 

£(m) = e -irrupg(rn) ^ y(m) = e ~imipy(m) ^ jj{m) _ e -im<pjy(rn) _ (3.36) 

The quantities 5 (m) , V (m) , and P (m) each constitute a new basis according to the choice of new unit 
vectors e± and e.2- Since we can expand any 3-scalar S, 3- vector V, and 3-tensor D in these sets of 
basis elements, i.e. 

S = S^SW = S^SW , 

1 1 

v = ^2 v^v^ = ^2 v^v^ , 

m=-l m=-l (3.37) 

2 2 

m=— 2 m=—2 

the corresponding coefficients must transform the opposite way, i.e. 

g(m) _ e imifig(m) ^ y(m) _ e imipy(m) ^ jj{m) _ ^rrup £j(m) _ (3.38) 

This means, under rotations around k, the coefficients transform like helicity states of helicity (or 
spin) m. Helicity states with m = are scalar perturbations, helicity states with m = ±1 vector 
perturbations, and helicity states with m = ±2 tensor perturbations. Scalar perturbations cor- 
respond to the usual energy overdensities of Newtonian physics and vector perturbations correspond 
to velocity perturbations in Newtonian physics. Tensor perturbations are also called gravitational 
waves and have no Newtonian analogon. We are mostly interested in scalar perturbations, since these 
are the perturbations that can undergo gravitational instability and can lead to structure formation 
in the universe. 
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By means of the Eqs. (3.37), S, V, and D are now entirely expressed in terms of helicity states. 
In the coordinate system {e±, e2, k} they have the explicit representation 



S = S< > 



Vi 



s/2 



Di 



3 2 

■ D(+2)_£)(-2) 



'71 



p(») 

3 



V (+D _ y(-Dj j _i V (0) 



\ 



—i 



D (+i) +D (-i) 
2V2 



p(+2)_p(-2) 
2 

_ D(+2) +jD (-2) 
2 

p(+l)_p(-l) 

2^ 



£>(+l) _£>(-!) 



2^2 
2D(°) 



/ 



(3.39) 



We can summarize the results of the last two sections as follows: In Fourier space, 3-scalars are 
functions of helicity m = 0, 3-vectors are superpositions of functions of helicity m = 0, ±1, and 
traceless, symmetric 3-tensors are superpositions of functions of helicity m = 0, ±1, ±2, while in real 
space, the helicity m = states correspond to 3-scalars, the helicity m = ±1 states to divergenceless 
3-vectors, and the helicity m = ±2 states to transverse, traceless, symmetric 3-tensors. Regarding the 
metric perturbation 5g^ v (see Eq. (3.7)), it follows that it can be decomposed into the helicity states 



£>[ 0) , B^, m = 0,±1, and H^ l > , m = 0,±1,±2. These are exactly 10 degrees of freedom as 
expected from the 10 independent components of the general perturbed metric. This confirms the 
generality of our analysis so far. 



r (m) 



3.2.3. Independence of different Fourier modes 

Before proving the decomposition theorem, we first have to show that at linear order the field 
equations and equations of motion for different k modes decouple. This follows from the translational 
invariance of the background FLRW universe. 

Suppose the field equations and equations of motion contain N perturbation quantities 5a, A = 
1, . . . , N. At linear order, the evolution of any perturbation 5a can generally be expressed as 

N 

5 a (t, k) = Y J t ab(t, n; k, k') 5 B (n, k') dk' 3 , (3.40) 

B=l J 

where Tab{t, t\; k, k') is the transfer function for the perturbation 5a and t\ < r is an arbitrary initial 
time. Note that Tab{ t i r ii k, k') can only depend on the background FLRW universe, since if it was 
dependent on a perturbation quantity, the term Tab(t, t\; k, k')5B(Ti, k') would be of second order 
and thus would be neglected in our linear treatment. Now we perform a coordinate transformation 
x = x + A.x with Ax a constant translation. Since the background FLRW universe is translational 
invariant, both the metric perturbation 5g iiV and the energy momentum perturbation 5T flv transform 
like 4-tensors under spatial translations. But transforming like a 4-tensor under spatial translations 
just means Sg^^T^x) = 5g llv (T,x), since dxi/dxj = 5ij. Thus in real space we have 5a(t, x) = 
5a(t, x) for any perturbation quantity. The transformation behavior in Fourier space is then 

5 A (r,k) = f h{T,x)e- ikA dx 3 = f 5 A (T,x)e-^ kx+Ax Ux 3 = e- ikAx 5 A (T,k) . (3.41) 
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Thus transforming Eq. (3.40) yields 

N 

~S A (r, k) = e- ikAx 6 A (r, k) = e~ lk * x £ / T ab (t, n; k, k') S b (t u k') dk' 3 

B=l J 

N 

= J2 e- ikAx T AB (T,T i; k,k')e ik ' Ax 6 B (T U k')dk' 3 (3.42) 

B=l ^ 

N 

B=l^ 

As the background FLRW universe is translational invariant, the transfer function must be transla- 
tional invariant as well, i.e. T ab (t,ti; k, k') = T ab (t,ti; k, k'), and so we obtain from the last two 
equalities in Eq. (3.42) 

e^ k '- k ^ x T AB (T,n;k,k') = T AB (r,r i ;k,k') (3.43) 

for any Aa:. This means that for k ^ k' the transfer function must vanish. Hence different Fourier 
modes are not coupled to each other and, for the further analysis, we can focus on a single arbitrary 
Fourier mode k. 



3.2.4. Decomposition Theorem 

Now we are able to proof the decomposition theorem that will simplify the subsequent analysis 
of the field equations and equations of motion immensely. The theorem states that due to the 
rotational symmetry of the FLRW background universe, perturbations of different helicity m evolve 
independently from each other to first order. The proof is very similar to the one given in the last 
section. 9 

Again, suppose the field equations and equations of motion contain a set of iV perturbation quanti- 
ties 5 A , A = 1, . . . , N, where m A is the helicity of the perturbation S A . Since different Fourier modes 
are decoupled, we can now express the evolution of a given perturbation 5 A as 

N 

S A (r,k) = J2 T Mr,n;k)S B (r h k) , (3.44) 

B=l 

where T ab (t, t\; k) is the transfer function associated to the perturbation 5 A for a given Fourier mode 
k and t\ is again some arbitrary initial time. If we perform a spatial rotation around k by some angle 
(p, Eq. (3.44) becomes 

N 

~5 A (r, k) = e im ^ 5 A (r, k) = e im ^ £ T AB (r, r ; ; k) 5 B (r u k) 

B=l 

N 

= £ T AB (r, ti; k) S B (n, k) (3.45) 

B=l 

N 

= ^2 T AB (T,Ti;k) 5 B (n,k) . 

B=l 



'A proof of the decomposition theorem for a general FLRW background universe is, for instance, given in Kodama & 
Sasaki (1984, App. B) or Straumann (2008). 
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Due to the rotational symmetry of the background FLRW universe, the transfer function must be 
rotationally invariant as well, i.e. Tab(t, T\;k) = Tab{t,t\; k) depending only on k. So we end up 
with 

e i ^- mB ^T AB (T,T 1 ;k) = T AB (r,T 1 ;k) (3.46) 

for any angle if. This means that for every index pair (A,B) such that tua / rriB, the transfer 
function must vanish. Thus different helicity states are indeed decoupled from each other. 



3.3. Field equations 

With the preliminaries of the previous section we are now able to compute the field equations and 
equations of motion in a sensible way. We compute them in Fourier space, where the independence 
of different Fourier modes allows us to focus on an arbitrary mode k, and split them in equations of 
different helicity according to the decomposition theorem (see Sect. 3.2.4). 
The field equations are 

= 8ttG , (3.47) 

where G^ v is the perturbed first order Einstein tensor. Since the unperturbed quantities satisfy the 
field equations, i.e. G^ v = 8-kGT^, we have to compute 

SG^ = 8nG 5T^ u , (3.48) 

whereas 5G^ U = G^ v — G^ v . To compute the field equations in Fourier space, we choose the basis 
{e±,e2,k} and represent the 3-scalars, 3-vectors, and 3-tensors of the metric perturbation 5g^ v and 
the energy momentum perturbation 5T^ V in terms of helicity states so that the particular expressions 
are given by Eq. (3.39). The computation of 5G^ U is straightforward, but lengthy and tedious. We 
will not go into the details of this calculation, but refer to the Appendix D of Kodama & Sasaki (1984), 
who provide explicit expressions for the perturbations of many geometrical quantities (e.g. Christoffel 
symbols ST" , scalar curvature 51Z, Einstein tensor dG^y) in terms of helicity states using the same 
basis as Eqs. (3.30)-(3.32). The field equations then become 10 



For the scalar perturbations, the density equation corresponds to the SG ' j—5G o component, the momentum equation 
to 5G°j, the pressure equation to 8G l i — <5G () , and the anisotropic stress equation to 5G X i — 1/3 5GV For the vector 
perturbations, the momentum equation corresponds to the 5G°j component and the anisotropic stress equation to 
SG'j. The tensor perturbations correspond to the 8G % j component. 
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Scalar field equations: 



density: 



^(0) + 1#(0) j _ n ^ B (0) + ^(0) j = 47rGa 2 ^(0) + 3 ^ (p + p) («(°) + B 
momentum: 

^(o) _ #(o) _ 1 _h = ^Ga 2 \ (p + p) (t,(°> + B<®) 
pressure: 

+ -Hd T - h 2 ^j - (3 T + W) (tff - = 4^Ga 2 (V 0) + hp^ 

anisotropic stress: 

k 2 (V) + + ^(°)) - (d T + 2%) (jfeBW + if (°)) = -8vrGa 2 n(°) 



Vector field equations: 



momentum: fc5 (±1) + ij (±1) = -16vrGa 2 i (p + p) (v {±1) + B^ 1 ) 
anisotropic stress: (<9 T + 2%) (*lB {±1) + F (±1) ) = 8vrGa 2 n (±1) 



Tensor field equation: 

anisotropic stress: 



(3.49) 



(3.50) 



(3.51) 



The field equation are complemented by the equations of motion given by the general relativistic 
energy-momentum conservation 



V U T^ = d u T^ + Pt v T^ v + VpJTtf = 



(3.52) 



Again using Appendix D of Kodama k, Sasaki (1984) (and additionally Appendix A for the unper- 
turbed expressions) we find 



Scalar equations of motion: 



continuity: (d T + 3M) 5p {0) + m5p (Q) = - {p + p) (W 0) + 3ij[ 0) ) 
Euler: ^ (d T + m) [(p + p) (w (0) + B^)] = 5p {0) - + (p + p) A<® 



(3.53) 
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Vector equation of motion: 



Euler: - (d T + AH) 
k 



(3.54) 



Note that the equations of motion are not independent from the field equations. However, since 
the field equations are second order differential equations and the equations of motion only first order 
differential equations, it might be quite useful to use the latter in place of two of the field equations. 
Furthermore, if the universe is made up of N non-interacting fluids [Tj]^ u , I = 1, . . . ,N, the energy- 
momentum conservation is satisfied separately by each fluid, i.e. V^fT/]^ = for I = 1, . . . , N. This 
information could not be derived from the field equations. 

Also note that the scalar and tensor metric perturbations do not have their own degrees of freedom. 
If the energy momentum tensor is fully specified, these metric perturbations are specified as well 
including their initial conditions. So to solve the scalar and vector equations, only initial conditions 
for the energy momentum tensor and its derivative are required. However, the situation is different 
for tensor perturbations. It is obvious that even if n( ±2 ) = Eq. (3.51) allows a nonzero solution for 
H^ 2 ^ and to specify this solution initial conditions for H^ 2 ^ and H^ 2 ^ are required. 

The field equations and equations of motion became much simpler by decoupling them into different 
helicity states, but they are still very complicated. Fortunately, there is still a way to simplify them 
without committing any further approximations within our linear treatment. This is choosing the 
coordinates x in a sophisticated way, which will be discussed in the next section. 



3.4. Gauge transformations 

So far we have described the perturbations in a given coordinate system x = (t,x), but we have 
not said much about the coordinate system itself. The only thing we know about this coordinate 
system is that it was chosen such that the metric g^ v approximates very closely the FLRW metric 
g^v allowing us to treat the difference of the two 5g^ u as a small perturbation. However, while in the 
limiting case of a homogeneous and isotropic universe there is a preferred coordinate system with 
the metric becoming particularly simple, i.e. it takes the form g^ v , there is no such preferred choice 
in the presence of perturbations. Therefore it might be useful to change the coordinate system by 
a small coordinate transformation, so that bg^ v in the new coordinate system is still very small. 
Such coordinate transformations are called gauge transformations. 11 While we have performed 
all calculations keeping the full generality of the metric g„ v , we are, in principle, free to choose a 
particular coordinate system (or a particular gauge) such that it takes a suitable form. 

From a fundamental point of view, no gauge is better than the other, but there might be certain 
gauges which are particularly appropriate for certain applications. More important, in the general 
gauge used so far, not all of the perturbations A, Hi, Bi, and Hij correspond to "physical pertur- 
bations" in the universe. Some of them are spurious and exist only in a particular gauge choice. 
This "unphysical gauge modes" impede the interpretation of the perturbations and are responsible 
for some confusion in the past about the meaning of some perturbations (see Kodama & Sasaki 1984 
Sect. 1 for a historical review). Therefore, it is important to eliminate these unphysical gauge modes. 
In the literature there are two different schools how this can be achieved. One way is by introducing 
"gauge independent perturbations" (e.g. Bardeen 1980; Kodama Sz Sasaki 1984; Mukhanov et al. 



11 Malik & Matravers (2008) provide a review about different perspectives on the mathematical interpretation of gauge 
transformations. 
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1992; Mukhanov 2005; Straumann 2006; Durrer 2008), the other way is to "fix the gauge" by setting 
constraints on A, Hl, B{, and Hij such that the coordinate system is completely specified (e.g. Ma & 
Bertschinger 1995; Seljak & Zaldarriaga 1996; Hu et al. 1998; Liddle & Lyth 2000; Weinberg 2008). 
None of these two approaches is superior to the other. The most important thing is to know how to 
correctly interpret the perturbation quantities. This is probably equally difficult in both approaches. 
Here, we will follow the second school and treat the gauge issue by fixing the gauge and changing 
between different gauges. 



3.4.1. Transformation rules 

A general gauge transformation has the form 



(3.55) 



where £ M (x) is treated as a small perturbation. In the following we will denote all transformed 
quantities by a tilde (e.g. q). How do perturbations transform under gauge transformations? As we 
already saw in Section 3.1.2 when lowering indices of perturbation quantities, we have to be very 
careful in treating perturbations as geometrical objects. The reason is that splitting variables into 
background part and perturbation part is a non-covariant procedure. Thus in general, perturbation 
quantities do not transform like usual geometrical objects. To derive their transformation behavior, 
we need the following first order relation 



Let q(x) be a 4-scalar, i.e. q(x) = q(x). It holds 

q{x) = q(x) = q(x - £) = q(x) - d^x) = q{x) - 8^q{x) ?{x) 



(3.56) 



(3.57) 



Thus with the definitions of the perturbations Sq = q — q and 5q = q — q, and with q(x) = q(x), we 
obtain the transformation behavior for perturbations of 4-scalars: 

5q(x) = Sq(x) - d^q(x) . (3.58) 

For a 4-vector q^, i.e. q^(x) = dx a /dx )1 q a (x), it works quite similar. With dx a /dx^ = 5° — d a ^(x) 
we have 

c)x^ r i r & 

= q^(x) - dpq^x) f(x) - q a (x) d^ a {x) . 

Thus again, with the definitions Sq^ = q^ — q^ and Sq^ = q^ — q^, and with q^(x) = dx^/dx 01 q a {x) 
we find 

Sq»(x) = 5q^(x) - dpq^x) f (x) - q a (x) d^ix) . (3.60) 
Analog we have for 4-tensors q^ v 

dx a dx@ r i r i r i 

%v{x) = -Q^-Q^p Qafi(x) = \S° - S^ a (s)J [5? - d u f(x)\ \ q a p(x) - djq a p(x) £ 7 (x)J , (3.61) 

and obtain by the same argument 

Sq^ v (x) = 5q^ v {x) - dpq^x) (?(x) - q^{x) d u f{x) - q av {x) d^ a (x) . (3.62) 
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So we can summarize the transformation behavior for scalar, vector, and tensor perturbations as 



8q(x) 


= Sq{x) - 


d^q(x) e(x) 






= Sq^x) - 




q a (x) d^ a (x) 




= hpLu{x) 


- dpq^x) f(x) 


- q^(x) d v (?{x) - q av {x) d^ a (x) . 



(3.63) 
(3.64) 
(3.65) 



Note that the argument is always the same on both sides and is thus not referring to the same point 
in spacetime unlike the usual transformation, e.g. q(x) = q(x), where x and x are different arguments 
but refer to the same point in spacetime. 

Knowing how Sg^ u (x) transforms we are able to derive the transformation behavior for the single 
helicity states that are contained in 5g^ u . To do this, we decompose £ M itself in helicity states (see 
Eq. (3.39)) 



£"(*) = (T,L) 



T(0) LC+i) + .L(+D - Li- 1 ) 



V2 



V2 



(3.66) 



Since there are no tensor modes involved, it follows immediately that tensor perturbations are in- 
variant under gauge transformations. The transformation behavior for the scalar and vector modes 
are obtained by simply comparing 5g^ u (x) to 5g^ u (x). Using Eq. (3.65) together with the relation 



O y g^C = dog^f + dig^e = 2Hg^T^ 
we obtain after a straightforward calculation 



(3.67) 



A(°)(ar) 


= A (0 \x)-f( \x)-nT^(x) 


m{ X ) 


= B^(x)-L^(x)-kT^{x) 


Hf{x) 


= Hf{x)- k -L^{x)-UT^{ X ) 


H<®(x) 


= H^\x) + kL i -°\x) 


B^\x) 


= BW(x) - L (±1) (x) 


E^\x) 


= E^ 1 \x) + kL ( - ±1 \x) 


E( ±2 \x) 


= E^ ±2 \x) 



(3.68) 



and similar for the helicity states of the energy-momentum perturbation 5T^ U 



5pt°\x) = 5p(°\x) - p(x)T (0 \x) 
Sp(°\x) = 5p i0) (x) -p(x)1< \x) 

(to) 

Tl {rn Xx) = rt m) {. 



r "°'(x) + L v " ( r 



[x 



[m> i^ m = 0,±l 
m = 0,±1,±2 . 



(3.69) 



Note that in the Eqs. (3.68) and (3.69) there is again always the same argument x on both sides. 
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3.4.2. Particular gauges 

Since tensor perturbations are gauge invariant, there is no gauge choice for them. On the other 
hand, since vector modes are very unlikely to be generated in the early universe (see the footnote 
in Sect. 4.1.2), we will not consider them any more. We will only consider scalar perturbations and 
thus drop the (0) superscripts for ease of notation. Although there are many scalar gauges discussed 
in the literature, we will only introduce two of them, because we will need them in the course of 
our analysis. The Newtonian gauge is particularly useful for analytical treatment inside the horizon 
and allows a very simple interpretation in terms of Newtonian physics. The comoving gauge will 
be used for the study of perturbations outside the horizon in the context of the generation of initial 
perturbations (see Ch. 4). 



Newtonian Gauge 

The Newtonian gauge is defined by the condition 

B = H = 0, 

and we will rename the non-vanishing perturbations by 

* = A , $ = -H L 



(3.70) 



(3.71) 



Using Eqs. (3.68) and setting B = H = we see that the Newtonian gauge is obtained from a general 
gauge by the transformation 

(3.72) 



nn B H 

T = 1 

k k 2 



L 



k 



Since the transformation is uniquely determined, the Newtonian gauge is entirely fixed. 
The field equations (3.49) become 12 



-fc 2 $ - 3H ( <& + W$> ) = 4Tra 2 Sp 



<j> + H$> = AirGa 2 - (p + p) 



$ + H (4f + 2$) + (m + + y ($ - *) = 47rGa 2 5p 



k 2 ($-$) = 8vrGa 2 n 



(3.73) 
(3.74) 

(3.75) 
(3.76) 



and the equations of motion (3.53) 



(d T + m) 5 P + msp = 


-(P + P) 


[kv - 36) 


(d T + AH) 








1- (p + p) * . 






5p- 2 U- 
3 



(3.77) 



These equations are still exact aside from the use of first-order perturbation theory and the assump- 
tion that the energy-momentum tensor is given by a fluid. Since 1/% is the comoving Hubble horizon 



12 The second and the fourth equations correspond to the momentum and anisotropic stress equation, respectively. The 
first equation is the density equation minus 37-L times the momentum equation. The third equation is the pressure 



equation minus 1/3 times the first equation. 
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(see Sect. 1.4.3), %/k <C 1 means that the length scale of a given Fourier mode k is well within the 
horizon. In this limit Eq. (3.73) becomes 

_ k 2 $ = ^Ga 2 5p (3.78) 

being identical to the Poisson equation (see Eq. 2.12) derived by Newtonian fluid dynamics, if $ is in- 
terpreted as the perturbation of the Newtonian gravitational potential. Thus $ has a simple physical 
interpretation and, well within the horizon, the Newtonian gauge reduces to Newtonian mechanics. 
We will call <3? the "generalized Newtonian potential". 13 Another advantage of the Newtonian gauge 
is that the metric tensor g^ v is diagonal which makes analytic calculation convenient. 

Eq. (3.76) yields a simple algebraic formula connecting ^ and <& by the anisotropic stress II. During 
the matter dominated era in which independent, ideal fluids (DM and baryonic matter) dominate 
the energy density, we may neglect the anisotropic stress. With this approximation we obtain \f ~ 3> 
and there remains only a single parameter in metric perturbation being the generalized Newtonian 
potential <3?. 



Comoving gauge 

The comoving gauge is defined by 

B + v = 0, H = 0. (3.81) 

Using Eqs. (3.68) and setting B + v = and H = we see that the transformation from a general 
gauge into comoving gauge is given by 

T=±(B + v), L = ~\- ( 3 - 82 ) 

This transformation is uniquely determined, so the comoving gauge is entirely fixed. We rename the 
remaining metric perturbations as 

£ = A, C = H L . (3.83) 

There is a relation between £ and the 3-curvature perturbation ^51Z on a slice of constant comoving 
time r. In a general gauge the 3-scalar curvature perturbation is given by 14 

V*6n = 4^} (h l + ±h\ , (3.84) 



so it reduces in the comoving gauge to 



^5K = £( • (3.85) 
a z 



As is shown in Section 4.2, the 3-scalar curvature perturbation is a useful quantity, since, under 
certain conditions, it stays constant outside the horizon if multiplied by a 2 . 



13 Note that the gauge invariant Bardeen potentials defined by Bardeen (1980) 

*A = A-±B-±HB-±(H + HH) , <$>n = H L + \H-\uB-^UH, (3.79) 
reduce in the Newtonian gauge to 

* = $a , $ = -$h • (3.80) 

This means that the simple physical interpretation of $ is automatically conveyed to — $h in all gauges, since <&h 
is gauge invariant. 

14 This is easily seen by using the expression for the 4-scalar curvature perturbation in the Appendix D of Kodama & 
Sasaki (1984) and setting all terms containing A or a time derivative to zero. 
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3.5. Evolution of the perturbations 

In this section we try to find solutions to the field equations and equations of motion from the 
previous section. After the electron-positron annihilation at a temperature of T ~ 10 9 , the universe 
contained only four different components: cold DM, baryons, photons, and neutrinos. We refer to 
these components with the subscripts d, b, 7, and v, respectively. While cold DM and neutrinos 
interacted with all components only by means of gravitation, the baryons and the photons were 
tightly coupled to each other until the epoch of decoupling. 15 Again we will only consider scalar 
perturbations and thus drop the (0) superscripts for ease of notation. 

3.5.1. Dark matter and baryons 

Since cold DM interacts with the baryons, photons, and neutrinos only by means of gravity, the 
conservation equation (3.52) is satisfied separately for the DM fluid pd]^, i.e. V^T^]^ = 0. In the 
Newtonian gauge, with p d = U d = for DM, the equations of motion (3.77) become 

(d T + m) 5p d = p d (3$ - kv d ) , (3.86) 
{d T + m)p d v A = kp d ^ . (3.87) 

Photons, neutrino, and baryons affect the evolution of the DM only by means of \E r and <£. By 
introducing the DM overdensity 

S d = *M (3.88) 

Pd 



Eqs. (3.86) and (3.87) simply become 



S d = -kv d + 3$ , (3.89) 
v d = -Uv d + W> . (3.90) 



So by differentiating Eq. (3.89) and replacing v d by Eq. (3.90) we can eliminate v d and obtain a 
relation between 5 d and the metric perturbations: 

S d + HS d = -fc 2 ^ + 3^6 + 3$. (3.91) 

In order to solve this equation, we have to know the metric perturbations VP and <!>, which depend 
on the total energy budget of the universe. However, in the matter dominated era we can assume 
T^ v ~ T£" , and so we have \t ~ due to IT ~ lid = (see Eq. (3.76)). Then for modes well inside 
the horizon we can replace k 2 & by the Poisson equation (3.78) yielding 

S d + U5 d = 47rGa 2 p d 5 d + 3^$ + 3$ . (3.92) 

If we can further ignore the time derivatives of we obtain 



6 d + H5 d - 4:TrGa 2 p d 5 d = , 



(3.93) 



which is equivalent to the evolution equation of DM in the Newtonian regime (see Eq. (2.20)). 
However, neglecting the time derivatives of $ is only acceptable, if the solutions lead to more or less 
time independent During matter domination, it holds a oc r 2 and so the growing and decaying 
modes of S d are given by 5 d oc r 2 and 5 d oc r~ 2 , respectively. Thus according to the Poisson equation 



J So Eq. (1.24) is not satisfied separately for photons and baryons during this time. 
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(3.78) the growing mode leads indeed to a constant $ inside the horizon during matter domination. 
This justifies Eq. (3.93) a posteriori. 

While Eq. (3.91) was derived for cold DM, it applies for baryons as well after the epoch of decoupling 
and on scales larger than the Jeans mass Mj (see Sect. 2.1), i.e. 

S h + nS h = -k 2 ^ + m$ + 3$ . (3.94) 

Subtracting Eq. (3.91) from Eq. (3.94), we obtain 




(3.95) 

The growing solution for the difference 5d — is a constant, so the difference between cold DM and 
baryon density perturbation does not change with time. Thus during matter domination it holds 

— z = 1 - — oc r . (3.96) 

This means that 5^ catches up with 5^ after decoupling and grows at an equal rate. 

This result can be summarized in simple terms as follows. During matter domination but before 
recombination (zd cc — 1089), the baryons are tightly coupled to the photons and thus are prevented 
from growing due to the photon pressure, while the DM perturbations grow unimpeded. After 
recombination, the photons basically propagate freely and the baryons start to feel the gravitational 
pull of the DM structures that have evolved in the meantime. However, since the photon-to-baryon 
ratio in the universe is huge, the residual ionization of cosmic gas keeps the temperature of the 
baryons close to the temperature of the CMB and the radiation drag prevents the baryons to fall 
into the DM potential wells (see e.g. Sect. 17.3 of Peacock 1999, Naoz & Barkana 2005). It is only 
for z < 100 that the baryons entirely decoupled from the photons and catched up with the DM 
structures. This is shown in Figure 3.1, where calculations from Naoz & Barkana (2005) for the 
power spectra of DM and the baryon density are shown at different epochs using an extension of the 
CMBfast code (see the following section). While for scales outside the horizon (k < 0.01 at z = 1000) 
both power spectra are equal, we see inside the horizon the result of the photon-baryon plasma before 
recombination in form of acoustic oszillations. The figure nicely shows how the baryons slowly catch 
up by falling into the DM structures over the timespan of z = 1200 to z = 200. 

This scenario is also a strong indication for the existence of some sort of non-baryonic DM, since 
the DM perturbations help the baryonic perturbations to grow. We saw in Section 2.1.3 that during 
matter domination the perturbations grow proportionally to the scale factor a. In fact, it can be 
shown (Lifshitz 1946) that in the general relativistic framework perturbations can maximally grow 
proportionally to the cosmic time t. However, since the baryonic perturbations were prevented from 
growing at least until the epoch of decoupling, which took place around ^dcc — 1089, and since the 
perturbations at that time were of the order of 10 -5 , they could in a baryon-only scenario (without 
any DM) maximally grow about a factor of ao/a(td cc ) = (zdcc + 1) ~ 1000 until the present epoch and 
would be of the order 10~ 2 today. Obviously, this is not enough to reach the nonlinear regime and to 
form the galaxies and clusters we observe today. On the other hand, since the DM perturbations were 
not coupled to the radiation, they could start to grow much earlier and thus enable us to reconcile 
the small perturbations at the epoch of decoupling as observed in the CMB with the nonlinear LSS 
today. 
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Figure 3.1. Power spectra (in dimensionless form k?V) of the density fluctuations of DM 
and baryons at different redshifts. The solid curve corresponds to DM perturbations and the 
dotted curve to baryon perturbations. The horizon scale at z = 1000 is about k ~ 0.01. 
Inside the horizon the baryonic acoustic oscillations in the power spectrum of the baryons are 
clearly visible and it is shown how over the timespan of z = 1200 to z = 200 the baryon power 
spectrum slowly catches up with that of DM, as the baryons fall into the gravitational potential 
wells of previously formed DM structures. (Adapted from Naoz & Barkana 2005) 



3.5.2. The complete treatment 

The results in the last section are only approximately valid under certain conditions and during 
certain cosmological eras. To obtain an accurate model for the transfer function T(k) of DM (see 
Sect. 2.1.4) at the present epoch or of the CMB multipoles Q, it is unavoidable to numerically solve 
the complete set of equations taking into account all existing components of the universe (cold DM, 
baryons, photons, and neutrinos). The derivation of this set of equations would require to go beyond 
the simple fluid approximation for T^, since it does not contain effects like "damping of oscillations" 
or "free streaming". Thus to accurately describe the evolution of photon and neutrino perturbations, 
one has to apply the general relativistic Boltzmann equation, i.e. 

" r >V^) / = C[f] , (3.97) 

to the 1-particle distribution functions f(x,p) of photons and neutrinos, where = (p°,p) is the 
momentum and C[f] the collision term, which accounts for Thomson scattering between electrons 
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and photons and vanishes for neutrinos. The energy- momentum tensor for a particle of mass m is 
then given by (cf. Eq. (1.28)) 

T^ix) = [ f(x,p) V^g&dp 3 , (3.98) 

JPm{x) P 

where P m (x) is the configuration space of particles with mass m at position x, i.e. all p 11 with 
9fiu(x)p tJ 'p u = —m 2 , and g is the determinant of g^ v . Since Thomson scattering can produce a net 
polarization of the photons, the complete analysis has to take into account polarization perturbations 
as well. The derivation of the full set of equations from first principles is given, for instance, in 
Weinberg (2008) in a self-contained way. We refer the interested reader to this comprehensive review 
for more details. 

After having derived the complete set of equations, the remaining challenge is to find a solution, 
which can only be achieved numerically. But even finding a numerical solution is extremely challeng- 
ing. Since the 1-particle distribution function f(x,p) is not only a function of spacetime x, but also 
of direction p, it is usually expanded into Legendre polynomials Pi(n), i.e. 16 

oo 

f(T,k,ri = Y,i- l (U + l)Pl(v)fl(T,k), fi = kp. (3.99) 
1=0 

The corresponding Boltzmann equation is then converted into a "Boltzmann hierarchy" using the 
relation 

(21 + 1) pPifr) = (l + l) + IPi-i(p) (3.100) 

and the orthogonality of the Legendre polynomials. This is a set of coupled differential equations 
where for each I there is an equation which couples the moments fi + \(r,k), fi(r,k), and fi-i(r, k). 
To compute the perturbations on scales appropriate for current CMB observations, one needs all 
Legendre moments fi up to I ~ 1000. Thus one has to solve a system of about 3000 coupled dif- 
ferential equations: 1000 for photon perturbations, 1000 for photon polarization, 1000 for neutrino 
perturbations (e.g. Ma & Bertschinger 1995, Seljak & Zaldarriaga 1996). Furthermore, this system 
of equations has to be solved for every Fourier mode k, and since the solutions are rapidly oscil- 
lating functions of time, the integration has to proceed in small time steps. This demands a lot of 
computation time even on present day computers. 

Fortunately, Seljak & Zaldarriaga (1996) found a way to compute all these Legendre moments 
without solving this huge system of differential equations. Following their method, one only has to 
solve this system for I < 20 and all the higher moments are then obtained by means of a line-of-sight 
integral. This is the heart of the CMBfast code (Seljak & Zaldarriaga 1996). The computation 
with CMBfast was about two orders of magnitude faster than the standard Boltzmann methods, 
while preserving the same accuracy which was about l%-2% at that time. In the meantime, the code 
has been continuously extended. The initial code was developed for scalar quantities in flat FLRW 
background cosmologies, but now it includes open and closed background cosmologies, vector and 
tensor modes, weak lensing etc. Today, for parameter sets around the present concordance cosmology, 

16 Since in both, the collisionless Boltzmann equation and the collision term C[f] for Thomson scattering, the direc- 
tional dependence enters only by means of the scalar product k ■ p = fi, the 1-particle distribution functions for 
photons and neutrinos also depend only on /i, if this was initially the case. So we may assume here that the initial 
momentum dependence is in fact axially symmetric. Thus the Legendre polynomials Pi(fJ.) constitute a complete set 
for representing f(r, fc, n) for each fc and r. Furthermore, since the field equations (3.49) and equations of motions 
(3.52) are linear differential equations depending only on fc = |fe|, two modes fc and fc' with |fc| = |fc'| obey the 
same time evolution up to the amplitude of the initial conditions. Thus it suffices to solve the field equations and 
equations of motion for each fc, i.e. we can write /(r, fc, £t). 
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the accuracy of the CMBfast code is about 0.1% for the CMB multipoles C[ up to I = 3000 and 
even better for the DM transfer function T(k) (Seljak et al. 2003). Today the CMBfast-package is 
not suported anymore, but there are other publicly available CMB-computation-packages, such as 
CAMB (Lewis et al. 2000) and CMBeasy (Doran 2005), which are based on CMBfast. 17 



17 For further information about these codes we refer to the corresponding websites http://camb.info/ and 
http: / / www.cmbeasy.org respectively. 
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Chapter 




Generation of primordial perturbations 



The generation of perturbations in the very early universe is important insofar as it produces the 
initial conditions for the theories of structure formation as described in the Chapters 2 and 3. The 
basic idea is that during inflation quantum perturbations are created and stretched to scales outside 
the horizon, where they are conserved until they reenter the horizon and become observable (see 
Sect. 1.6.3, where this process is described in simple terms). We will consider only the simplest 
inflationary scenarios which are driven by a single scalar field <f> (the "inflaton" ) and obey the "slow 
roll" conditions (see Eq. (1.81)). 

In this chapter we develop the basic theory behind the generation of perturbations in the very 
early universe. In Section 4.1 we quantize the initial perturbation during inflation by means of 
the canonical quantization, then in Section 4.2 we derive the behavior of perturbations outside the 
horizon, and finally in Section 4.3 we derive the primordial DM power spectrum for scales inside the 
horizon. 

Since for modes outside the horizon we are in the relativistic regime, this chapter is based on the 
general relativistic theory of linear perturbations, which was described in the previous chapter. The 
quantization of the perturbations is performed in Newtonian gauge, the constancy of perturbations 
outside the horizon is shown for comoving gauge, and finally for the power spectrum within the 
horizon we transform back to Newtonian gauge. Since we have to deal with perturbed scalar fields 
4>, we also need certain relations from Appendix A. 2. 2. Readers that are not familiar with the theory 
of scalar fields in the context of general relativity are encouraged to first study the full Appendix A. 
Also some general knowledge on "canonical quantization" of a scalar field in Minkoswski spacetime 
is required and we refer the reader to textbooks of quantum field theory for an introduction (see e.g. 
Mandl & Shaw 1993). 

Like in Chapter 3 the background FLRW universe is assumed to be flat and the time coordinate 
is always conformal time r (see Eq. (1.8)). The Friedmann equations (1.19) and (1.20) are then 

U 2 = ^Gpa\ n = ~aHp + 3p), (4.1) 

where % = a/ a is the conformal Hubble parameter. We consider only scalar perturbations so that 
the superscript (0) can be omitted in the following. Units are chosen such that c = H = 1. 

4.1. Quantization of perturbations 

To quantize the initial perturbations in the universe, we need basic concepts from quantum field 
theory. In general, the quantization of a system in curved spacetime is rather complicated and the 
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meaning of a particle or even of the vacuum is slightly obscured (see Birrell h Davies (1982) for a 
general discussion). Fortunately, the Lagrangian for the scalar field perturbations reduce to those of 
a scalar field in Minkowski spacetime (up to an effective time dependent mass). This simplifies our 
treatment a lot. 



4.1.1. Classical equation of motion 

The proper quantization of a system has to start from its action S. It is dangerous to simply compute 
the classical equation of motion and try to interpret it in the context of quantum field theory. This 
could lead to a wrong normalization and thus to an incorrect result as demonstrated by Deruelle 
et al. (1992). The action for the field 4> i n Newtonian gauge is given by Eqs. (A.1)-(A.4): 

5* = j^C^-gdx A = jf (jj^K - \d^d^ - V(4)j ^g dx 4 . (4.2) 

In order to find the action for the field perturbation 5(j), the action (4.2) needs to be expanded to 
second order in perturbations. This is a straightforward, but lengthy calculation and will not be 
reproduced here. The result is (Mukhanov et al. 1992, Sect. 10.3) 

S q = J C q dx 4 = i J (^-v a ^d a q dpq + -J 2 + total derivatives^) dx 4 , (4.3) 

where we have introduced the notation 

q(x)=5cP + h, z ^) = a ^- ( 44 ) 

The action (4.3) effectively describes a Klein-Gordon field v with time-dependent mass m 2 (r) = — z/z 
in the Minkowski spacetime n a p = diag(— 1, +1, +1, +1) (cf. Eq. (A. 11)). This becomes obvious by 
deriving the equation of motion for q. Varying Eq. (4.3) with respect to q yields the classical equation 
of motion by means of the Euler-Lagrange equation (A. 10) 

- ^d a dpq -- q = q-Aq--q = 0. (4.5) 

z z 

The total derivatives vanish due to the usual condition 5q = on dV. 

Before quantizing the system, we want to find analytic solutions for Eq. (4.5). Such solutions 
are found by introducing the following approximations. Note that during slow roll inflation (p/H is 
approximately constant and so it holds to first order 



q~ %= (n+n 2 ^q^2n 2 q + 0(n 2 eq) , (4.6) 



where e is the slow roll parameter (see Eq. (1.83)). If we neglect the term of order O (% 2 eq), the 
equation of motion becomes 

q - (A + 2V 2 ) q = 0. (4.7) 

This can be further simplified by measuring our time with respect to the end of inflation Tf . That is 
we introduce a new time variable f such that 
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where we have assumed that the Hubble parameter H = %/a is approximately constant during 
inflation and the scale parameter at the end of inflation ctf is much larger than at the time r, 
i.e. a(r)/of <C 1. This means that r and f are identical up to a constant shift. For ease of notation 
we will denote this new time also by r. The approximate equation of motion (4.7) then becomes in 
Fourier space 

q k +{k 2 -^jq k = Q, (4.9) 

which allows analytical solutions. Since it is a linear differential equation of second order, it has two 
independent solutions for each k mode, and since with qk(r) also q%(r) is a solution, the solutions 

along with 9j1(t) for all k constitute a complete set of independent solutions to Eq. (4.9). 

Note that the relation (4.8) allows us to express the comoving Hubble length (see Sect. 1.4.3) as 
\/% = — t. So the condition for a comoving Fourier mode k to cross the horizon is— kr = k/% = I. 



(4.10) 



4.1.2. Canonical quantization 

As already mentioned, the action (4.3) describes a Klein-Gordon field q with time-dependent mass 
in Minkowski spacetime. Since we can quantize this field as in standard quantum field theory (up to 
the time dependent mass), we will quantize q directly rather than Sep. 1 

The first step of the canonical quantization is determining the canonical conjugate momentum field 
to q defined by 

tt(x) ee ^ = q(x) , (4.11) 

and interpret these variable as operators 

q(x) ->• q(x) , tt(x) ->• vr(x) = q(x) , 

subject to the equal-time commutation relations 

[q(r, x),tt(t, x')] =iS(x- x') , [q(r, x),q(r, x')] = [vr(r, x),tt(t, x')] = . (4.13) 

Since is a real scalar field, the operators q and 7r are hermitian and we can expand them into 
Fourier integrals in the following way 

1 This is no loss of generality, since for a single scalar field <f> the system has only one degree of freedom. That is the 
metric perturbations are determined as soon as S(j> is determined and vice versa. So once we have quantized q all the 
other perturbations such as 5(f) and the metric perturbations follow through the constraints which relate them to q. 
As a consequence there are no scalar metric perturbations without a scalar field. The same holds for metric vector 
perturbations, i.e. there are no metric vector perturbations present without a vector source. Since a scalar field 
does not exhibit any vector perturbations (see App. A. 2. 2), no vector perturbations are generated during inflation. 
However, this is not true for tensor perturbations ("gravitational waves"). Tensor perturbations have their own 
degrees of freedom which might get excited during inflation even without tensor sources. This is why gravitational 
waves but no vector perturbations can be generated in inflationary scenarios. 



(4.12) 



(4.14) 
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where the functions q k (r) are the complete set of solutions to the equation of motion given by 
Eq. (4.10). Note that these functions are normalized such that it follows with the commutation 
relations (4.13) for the operators d k and a) k 

[a*, 4'] = ( 27r ) 3 5 ( k - ' [ fi *> = [°L "Li = 

as required by the canonical quantization. Finally, we define the vacuum state |0) by 2 

a k \0) = 

for all k. 

4.1.3. Expectation values 

Now we will investigate the expectation values of the perturbations. The simplest and most natural 
assumption is that the state of the universe during inflation is the vacuum |0). 3 In this state, the 
expectation value fi q (x) of the field q(x) is 

/i ,(x) = <0|g(r,x)|0> = ^3 / (?fc(r)<0|a fc |0)e <feB + <£(r)(o|4 \o)e^ kx ) dk 3 = (4.17) 

and the correlation function £ q (r,x,x') of q(x) is given by (cf. Eq. (2.28)) 
£ q (T,x,x') = (0|g(r,aj) g(r,aj')|0> 

= / 1 ^(r) q* k ,(T) (o\d k al\Q) e -C— dk 3 dk 

= (2^)3 / ^ Hx '- X) dk\ 

where we have used the relation 

(0\a k a[,|0> = (0|4 a k ,\0) + (2vr) 3 S(k - k') = (2tt) 3 5(k - k') 

Thus the power spectrum (see Eq. (2.29)) can be directly read off from Eq. (4.19) as 

Qk(r) q* k ,(r) (0\d k al\0) = (2vr) 3 <5(fc - fc')|g fe (r)| 2 = (2vr) 3 6(k - k')V q (r , k) , (4.22) 

where V q (r, k) is rotational invariant due to the rotational invariance of q k and different k modes 
are decoupled due to the canonical commutation relations (4.15). The latter property of the power 
spectrum makes the correlation function to depend only on the difference x — x', and the former 
property leads to rotational invariance for the correlation function. That is it holds in fact £<j(t, \x — 
x'\) = £ q (T,x,x') as was assumed in Section 2.2 as a consequence of the cosmological principle. 

2 As discussed in Chapter 11 of Mukhanov et al. (1992), the vacuum |0) changes for different times r due to the time 
dependence of the effective mass m 2 (r) = —z/z. This is a common feature of the vacuum in curved space (see 
Birrell & Davies 1982). However, at least for modes well inside the horizon, i.e. H/k <JC 1, the short-wavelength 
part of the initial vacuum spectrum should be independent of the choice of the vacuum. This condition is satisfied 
to high precision if inflation lasts long enough. So there is no problem computing the power spectrum of initial 
perturbations as long as a considered perturbation was initially well inside the horizon. 

3 This is not the only possibility as discussed in Section 10.1 of Weinberg (2008). A reason to expect the system to be 
in the the vacuum state during inflation is that the vacuum is the energetically lowest state and so any other state 
should decay into the vacuum. However, it is not clear whether this would happen fast enough. 



(4.15) 



(4.16) 



(4.18) 
(4.19) 

(4.20) 



(4.21) 
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Starting with a perturbation k well inside the horizon so that its vacuum |0) is well defined, and 
waiting until the mode is well outside the horizon, i.e. kr <C 1, we have 

*WS(r) - I (l + ^) = ^ = % (423) 
and thus the power spectrum well outside the horizon is 



(4.24) 

4.1.4. Gaussianity 

What is the probability distribution function of the field q(x) in the ground state |0)? To answer 
this question, we express the operator q(r, x) in terms of its Fourier transformed operator q(r, k) as 

«,->-^/(^ + « W 4*-)* 

= (2^ / ( 9fc(T) hk + qUT) a "* ) e *" dk " (425) 
= (2^3/^ (T ' fc)e ^ dfc3 - 

Thus it holds 

q(r, k) = q k a k + q* k a)_ k , vr(r, fc) = (?(t, fc) = q k a k + ^a^ fc . (4.26) 
We can now introduce four hermitian operators 

qRc(T,k) = q k a k +q* k a}_ k +q* k a k + q k a_ k , tt Rc (t, k) = q Rc (r, k) , 

+ + • v '/ 

«m(r, k) = q k a k + <?fcal_ fc - q* k a ] k - q k a- k , Thm(r, k) = qi m {r, k) , 

such that 

q{r,k) = q Rc (r,k) + iqi m (r,k) , tt(t, k) = ^R e (r, k) + i 7Ti m (r, k) . (4.28) 

These are the real and imaginary parts of q(r, k) and 7r(r, k), respectively. Their equal time commu- 
tation relations are as follows: 

[q Re (r,k),n Re (r,k')] = ^(2vr) 3 [S(k - k') + 8{k + k')] , 

1 (4.29) 
[q lm (r, k), 7r Im (r, k')] = ^(2vr) 3 [S(k - k') - S(k + k')] , 

while all other pairwise commutators vanish. 

The non- vanishing commutation relations between modes with k and — k are due to the reality of 
the field q(x) leading to q(r,k) = q*(r,—k). Hence with q(r,k) also q*(r, — k) is fully determined 
and so they must exhibit the same commutation relations with respect to tt(t, k) and tt*(t, — k) 
respectively. However, if we restrict ourselves to the upper half of the Fourier space, i.e. k z > 0, the 
real as well as the imaginary parts of q(r, k) are simultaneously measurable and behave (together 
with their canonical momenta) like independent harmonic oscillators (up to the factor 1/2). So 
analog to the simple harmonic oscillator in quantum mechanics, the probability density functions of 
the real and imaginary parts of q(r, k) are independent Gaussian distributions for every k with the 
constraint q(r, k) = q*(r, —k). 
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4.1.5. Transition from quantum perturbations to classical perturbations 

So far we treated the universe entirely quantum mechanical. But when and how does the transition 
to the classical universe that we can observe today take place? For modes well outside the horizon, 
i.e. kr <C 1, the Fourier transforms (4.26) become 

g(r, k) C --L 1- (a fce -*r _ a t_ fcC «r) > #(Tj fc) ^ y| _L_ ( afce -*r _ a t_ fc( ,ttr) . (4 . 30) 

That is g(r, fe) and 7r(r, fe) are proportional to the same operator and thus they started to commute. 
This means that the system behaves classical outside the horizon. 

This simple consideration, however, does not explain how a particular realization — our universe — 
is chosen out of the quantum ensemble during this process (see Mukhanov 2005 for a brief discussion) . 
The way this happens lies at the very base of quantum mechanics and is not yet fully understood in 
general. Here we will just assume that outside the horizon the field q became classical and also the 
power spectrum (4.24) can be interpreted as a power spectrum of an ensemble of classical universes. 

Thus we can summarize this section with the statement that during inflation quantum mechanical 
processes produce tiny fluctuations in the universe, which become classical as they leave the horizon 
and can be regarded a realization of a Gaussian random field with zero mean and with a translational 
and rotational invariant correlation function. 

4.2. Conservation of perturbations outside the horizon 

One of the key points in the context of inflation is the behavior of perturbations outside the horizon. 
We will show that in comoving gauge the 3-curvature perturbation ^8TZ times a 2 (or equivalently 
the perturbation £, see Eq. (3.85)) is conserved outside the horizon if perturbations are adiabatic. 
Without this feature, it would be basically impossible to make any robust prediction from inflation, 
since we practically know nothing about the fundamental physics associated with inflation and the 
transition from inflation to the radiation dominated universe. 

4.2.1. Adiabaticity 

After the inflationary slow-roll stage, the inflaton (f> decays by a process that is not very well un- 
derstood and the unverse becomes radiation dominated. What can we say about the perturbations 
of the decay products of the inflaton? To answer this question we consider a scale k well outside 
the horizon and smooth the universe on this scale at every point. Since k is well outside the hori- 
zon, each smoothed patch is causally disconnected from any other smothed patch and evolves like 
a homogeneous and isotropic universe. These "universes" have slightly different mean densities and 
become identical if we synchronize them on slices of constant <f>. Since the inflaton is the only field, 
such a synchronized universe would be absolutely homogeneous. So whatever happens after inflation, 
it happens everywhere the same, and any decay product of (j> will be homogeneous as well. Then 
transforming back to the old or any other time variable yields (see Eq. (3.63)) 

Ss(T,k) = -s(r)T(T,k) (4.31) 

with the same T(r, k) for any 4-scalar s(r, k). That is, we find for any energy contribution I 
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(4.32) 
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in any gauge as long as the scale k is outside the horizon. This is the generalized adiabatic 
condition. Moreover, with Eq. (4.31) it also follows 



(4.33) 



Sp 5p 
P P 

which is important for the following reason. Since it holds for the pressure 

p(r, k) = p{p + 5p,S + SS) =p + 5p (4.34) 
with S = S + 5S the entropy of the fluid, we can express 5p to first order as 

^ = (!),■ ° = (4 - 35) 

where c s is the speed of sound. So with Eq. (4.33) it follows 



dp p dp ( dp 



8p p dp \dpJs 



c 2 s , (4.36) 



where we have used that the entropy is conserved in a FLRW universe. 4 Thus with Eq. (4.36) we 
have for the entropy perturbations 



SS = - (Sp - c 2 s 5p) = . 



(4.39) 



This is why these perturbations are called adiabatic. The adiabaticity of the perturbation is important 
for the conservation of perturbations outside the horizon as is shown in the next section. 

4.2.2. C outside the horizon 

In this section we show that in comoving gauge £ is conserved well outside the horizon. As we 
deal with different gauges, we denote quantities in the Newtonian gauge with a superscript "N" and 
in the comoving gauge with a superscript "com". Using Eq. (3.82) the gauge transformation from 
Newtonian gauge to comoving gauge is 



V N 



k 

So we can express 8p com in terms of Sp N as 



. (4.40) 



„N N 

<5p com = J P N - = Sp N + m(p + p) — . (4.41) 



4 The conservation of entropy in a FLRW universe is easily seen. Let V be a comoving volume, so that its corresponding 
proper volume is V pl = a 3 V. The equation of motion Eq. (1.23) can be written as d(pa 3 ) / d(a s ) = —p. The change 
of the internal energy dU in this comoving volume during the expansion of the universe is then related to the change 
of the proper volume dV pr by 

dU = d(pa 3 V) = -p d(a 3 V) = -p dV pl . (4.37) 
Thus, during the expansion of the universe, the change of entropy in this comoving volume is 

ds = dU + p dV pi - Ej fudNt = E, tMdNj (43g) 

where Ni is the number of particles of species i in this volume and \jh the corresponding chemical potential. If 
Hi ~ 0, i.e. negligible particle-antiparticle asymmetries, it follows dS = within any comoving volume V. 
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With the density equation (3.49) in the two gauges this becomes 



_ k 2 ® = 4irGa 2 5p N + 37i (p + p) — = AitGa z 5p com = k\ + Uk v 

K 



(4.42) 



For slow roll inflation with a single scalar field, the perturbations are adiabatic, i.e. Sp com = c 2 5p com 
(see Eq. (4.35) for 5S = 0). So it follows 



-U 



5p c 



P + P 



c 2 s ns P co 
p+p 



c 2 s k 2 U<5> _ 2 fc s k 



P + P 



H J p + p 



(4.43) 



where in the first and the second step we have used the momentum and the Euler equation (3.49) 
in comoving gauge, in the third step equation (4.42), and in the last step the Friedmann equation 
(4.1). Thus |C| is of order 




(4.44) 



and can be neglected for k/H < 1. So ( is roughly constant outside the horizon if the perturbations 
are adiabatic. 5 

4.2.3. $ outside the horizon 

There is also a similar theorem for the generalized Newtonian potential as for (. Since L = for 
a transformation from Newtonian to comoving gauge, it follows from Eq. (3.69) immediately that 

..corn _,N 



v . Using the transformation from Newtonian gauge to comoving for the perturbation H we 



have 



k 



(4.45) 



where we have used Eq. (4.40). Then using Eq. (3.74) we can connect the Newtonian perturbations 
and $ with £ by means of 



JN q IN 

^> + nm = ^Ga 2V —{p + p) = ^u 2V —{i + w) 



-H (1 + w)(( + $) , 



(4.46) 



where in the second step we have used the Friedmann equation (4.1) and in the last step Eq. (4.45). 
Solving this for £ yields the following relation for any equation of state w: 



C = -<£- 



2 * + 

3 1 + w 



(4.47) 



5 Weinberg (2008) gives in his Section 5.4 a proof of a more general theorem stating that whatever the contents of the 
universe are, there are always two independent adiabatic physical scalar solutions for which £ is time independent 
outside the horizon. Thus if cosmological fluctuations are described by such a solution during inflation, £ will also 
stay constant after inflation since this is always a solution. In fact, having this theorem the adiabaticity of the 
perturbations and constancy of £ follows immediately for slow-roll inflation with only one inflaton. Since for such 
an inflationary scenario there is only one degree of freedom and since the equation of motions are second order 
differential equations, there are only two independent solutions and these must be those mentioned by the theorem, 
for there are always two such solutions. However, if inflation is governed by more than one inflaton field, this 
reasoning is no longer possible and there might by entropy perturbations produced during inflation. 
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Ignoring the anisotropic stress II, i.e. $> = and allowing only adiabatic perturbations, i.e. 5p N 
<$5fF, we obtain with Eqs. (3.73) and (3.75) 



$ + m§ + (VH + U 2 ^§ = 47rGa 2 5p N = 4irGa 2 c 2 5p N 
arriving at the following differential equation for <&: 



A- 2 * + \YH (<£> ■+ H<I> 



4> + 3^6 (1 + c 2 s ) + [2?i£ + U 2 (1 + 3c 2 ) 



$ + c 2 k 2 $ = . 



(4.48) 



(4.49) 



For a constant equation of state w(t) = w we can find an analytic solution to this equation for scales 
well outside the horizon. Uw is constant, it holds (see the Eqs. (4.36), (1.25), (1.33), and (1.34)) 6 



c 2 = ^ = w , a(r) oc T 2 ^ 1+ ^ 
s dp 



H(t) 



l + 3w 



so that Eq. (4.49) reduces to 



(4.50) 



(4.51) 



1 + 3w T 

For scales well outside the horizon kr oc k/H <C 1 we can neglect the last term and the general 
solution in this limit is then given by 



$(t) = ci + c 2 r 17 , 



6(1 + w) 5 + 3u> 
- = 4 ^ - 1 = — > 1 , 



l + 3w 



l + 3w 



(4.52) 



where c\ and c 2 are two constants. Thus for a constant equation of state, the growing mode of $(r) 
outside the horizon is just a constant, i.e. 

$ ~ const . (4.53) 
Thus in this case the relation (4.47) between £ and <5 simplifies to the constant expression 




(4.54) 



4.3. Primordial power spectrum 

After the end of inflation, the universe finally turned into the radiation and then matter dominated 
eras, during which the universe is decelerating. So the modes that left the horizon during inflation 
will reenter it after a certain time (see Fig. 1.3). In the previous section we studied the behavior 
of the perturbation when they are outside the horizon. In this section we use these results along 
with the power spectrum (4.24) that was created during inflation to compute the explicit form of the 
primordial DM power spectrum inside the horizon for modes that enter the horizon during matter 
domination. The power spectrum for modes that enter during radiation domination can then be 
obtained by using the transfer function T(k) (see Sect. 2.1.4). 



"Note that we are changing again the zero point of the conformal time coordinate. Moreover, we assume here that the 
timespan when the universe was not dominated by the fluid with the constant equation of state w was short enough 
compared to the time r, so that using Eq. (1.33) along with (1.34) instead of Eq (1.37) is a good approximation. 
This approximation is acceptable for the radiation dominated epoch as well as the matter dominated epoch (see 
Eqs. (1.73) and (see Eqs. (1.72)). Also note that the equations of Chapter 1 are formulated by means of cosmic 
time t instead of conformal time r. 
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4.3.1. Scale invariant power spectrum 

Due to the constancy of ( outside the horizon, we can evolve the power spectrum (4.24) until the 
time when a given mode k reenters the horizon. To express the power spectrum in terms of £ we 
need a relation between £ and the variable q that was quantized during inflation, which is obtained 
by transforming the right hand side of Eq. (4.4), i.e. 

q = 5<f> N + ^ $ , (4.55) 
from Newtonian gauge to comoving gauge. Using Eq. (3.63) and Eq. (3.82) with B = yields 



<5</> com = 5<j> N - I v N (4.56) 



and so with Eq. (4.42) we have 



9 = ^ N + ^ = ^ com -^C- (4.57) 



Using Eq. (A. 37) it holds in comoving gauge 

$ 5<j) com = (4.58) 

and thus 8<jf OUi = 0, since (f) is generally a nonzero function determined by the background cosmology. 7 
So we obtain the relation between v N and £ as 

C=¥(5<T m -</) = -¥<?■ (4.59) 

The power spectrum well outside the horizon is then given by constant expression 

(*(*)**(*')> = C 2 (C(fc)C*(fc')> = C 2 (J \ (q(k)q-(k')} (4m) 

-CW'<* -*)(?)' \^) , (4.61, 

where in the first step we have used Eq. (4.54) for a constant w and in the last step power spectrum 
at horizon exit given by the Eqs. (4.22) and (4.24). The constant C is —2/3 during radiation 
domination and —3/5 during matter domination. Since H/<fi and Ti/a? are roughly constant during 
slow-roll inflation, we can evaluate the right-hand-side of Eq. (4.61) for each mode at the time r ou t(fc) 
when it leaves the horizon. 

The power spectrum of DM is defined by (Sd{k)5^(k')}, where 5d(k) = 5pd/pa and pd = pd + $Pd 
is the matter density of DM. If the universe is dominated by DM, i.e. p ~ pd, then we can relate 3d 
and <1> well inside the horizon by means of the Poisson equation (3.78) 

_ k 2 § = ^Ga 2 5 PA = 4TTGa 2 p d 5 d = ^HS d , (4.62) 



7 Only for a constant potential V{<j>) is a constant <j> a solution to the equation of motion (A. 27). This case is not 
considered here. During slow roll inflation cj> is only approximately constant and so <50 com must be exactly zero to 
satisfy the relation (4.58). 
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where in the last step we have used the Friedmann equation (4.1). With this relation and with 
Eq. (4.61) we get for the power spectrum inside the horizon 



(S d (k)8* d (k')) = (2vr) 3 5(fc - fc') 



2\ V k\ 2 ( H 4 



5/ \HJ \20 2 k 3 a 2 



(4.63) 

Tout (fc) 



Note that to properly compute the DM power spectrum inside the horizon, we would have to evolve 
$ from outside into the horizon. But since during matter domination is also constant inside the 
horizon (see Sect. 3.5.1) we will accept the approximation (4.63). Moreover, for simplicity we will 
evaluate the term (k/T-L) 2 at horizon entry, so that it just becomes unity for all k. The DM power 
spectrum within the horizon then becomes at horizon entry T m (k) for each mode 

(S d (k)S* d (k')) = (27r) 3 5(k - k')V d (T, k) , V d (r in (k), k) oc ^ , (4.64) 

since during slow roll inflation H/<j> and T-L/a 2 are roughly constant and thus are the same for all k. 
It is convenient to evaluate V&(t, k) at the same time r for all modes k. To achieve this, we express 
P d (r, k) as 

Pd(r, k) = P d (r in (k), k) ( D ^ k)) ) > ( 4 -65) 

where D(t) is the growth function (see Sect. 2.1) being independent of k at first order. With D(t) oc 
t 2 during matter domination and since the condition of horizon entry is kr Ui {k) ~ k/T-L{T m {k)) = 1, 
it holds 

D(r m (k)) oc k~ 2 (4.66) 
and finally the primordial power spectrum inside the horizon at a given time is 



V d {r,k) oc k ns D(r) , n s ~l, 



(4.67) 



where n s is the spectral index. Interestingly, this form of the power spectrum was proposed even ten 
years before inflation was introduced (see Sect. 2.2.2). 

The power spectrum is sometimes expressed in dimensionless form as 

A d(^ k) = ^k 3 V d (r, k) oc P=+ 3 . (4.68) 
Thus with Eq. (4.62) the dimensionless power spectrum for $ is independent of k, i.e. 

A*(ife) oc ^fi- oc fc" 8 - 1 oc const . (4.69) 

This is why the power spectrum (4.67) for n s = 1 is called scale invariant. However, this scale 
invariance holds only approximately. In the next section we will compute the deviation from scale 
invar iance. 

4.3.2. Deviation from scale invariance 

The deviation of the power spectrum from scale invariance is produced by the amount the right hand 
side of Eq. (4.61) changes during inflation for the different modes. It is obtained by computing (see 
Eq. (4.69)) 

d^> =k d[(n s -l)lnk] =ns _^ 
dink dk 
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For our calculation, we again adopt the time coordinate which is defined with respect to the end of 
inflation (see Eq. (4.8)), so that it holds —kT out (k) = 1 at the time of horizon exit for a given mode 
k. Then we can express the derivative after Ink as 



d 



dink 



k 



d 

dk 



k 



dT OVLt d(f> ^ 
dk dr ou t d<fi 



d 



dink 



Tout (fc) 



<KTout(fc)) d_ 

k d(j) 



Tout (fe) 



(4.71) 



Eliminating % in the two Eqs. (1.82) and solving for <\> gives 

j V U 

With this relation and again with Eq. (1.82) we can express A$ as 



A* oc fc 3 ($(fc)<r(fc')} oc ^- oc V ( 



oc 



y/2 



(4.72) 



(4.73) 



The deviation from scale invariance is then given by 



n s - 1 = 



din A$ 



din A; 
( 6 



1 V d 



Tout(fe) 



V 



8vrG V 



(3 In V - 2 In V') 



Tout(fc) 



16vrG V V 



+ 



V 



(4.74) 



8vrG V" 



Tout(fc) 



Thus with the definition of the slow- roll parameters (1.83) this becomes 



"8-1= (- 6e + 2r ?)lTout(fc) 



(4.75) 



If the spectral index ra s is unequal 1, the spectrum is called "tilted", where n s > 1 is called "blue 
spectrum" and n s < 1 "red spectrum". The measured value of n s is about 0.97 (see Tab. 1.1), so the 
actual spectrum in the universe is almost scale invariant, but slightly tilted to the red side. This is 
an excellent confirmation of the phenomenology of slow roll inflation. 

Thus we conclude this chapter with the statement that as a result of the simplest models of inflation 
the DM perturbation 5d on large scales (linear regime) but well inside the horizon can be regarded as 
a realization of a homogeneous and isotropic Gaussian random field with zero mean and an almost 
scale invariant power spectrum. Additionally the perturbations are adiabatic, i.e. outside the horizon 
any energy contribution is subjected to the generalized adiabatic condition (4.33) and there are no 
entropy perturbations. 
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Appendix 

Classical scalar field theory 



The simplest kind of field in classical (and quantum) field theory is a real scalar field 4>{x). This is also 
the kind of field that is the dominant energy component in the simplest models of cosmic inflation 
(see Sect. 1.6). The theory of the scalar field is usually formulated by means of the Lagrangian 
(density) C for the field (f). In the first section of this appendix we briefly introduce the scalar field 
theory in the context of general relativity and apply it in the second section to cosmology. We adopt 
natural units, i.e. c = h = 1. 

A.1. Scalar field theory in general relativity 

The Lagrangian C<f> for a real scalar field 4> moving in a potential V{4>) and in the presence of gravity 
is given by 

= -^v^w - v(cf>) = -\d^d»<f> - v(<f>). (A.i) 

The second equality follows from the fact that applying the covariant derivative to a scalar 
field reduces to the normal derivative d^, i.e. = d^. Note that the gravitational interaction 
enters this formalism merely through the metric g^ v as always in general relativity. For g^ u — > 
r}^ = diag(— 1, 1, 1, 1) gravity is "switched off" and we are in the regime of special relativity. The 
Lagrangian for the metric field is 

£h = ^R, (A.2) 

where 1Z is the curvature for g^ u . This is the Lagrangian of the Einstein-Hilbert action. So the 
total Lagrangian becomes 

C = C n + C <t> (A.3) 

with the associated action 

„4 



S= / C^dx\ (A.4) 
Jv 

where V is a compact region with smooth boundary dV and g = det(g MJ ,). 

The variation of the action with respect to the field tp, where tp stands for either the scalar field <j) 
or the metric component g^, is defined by 



(A.5) 

e=0 
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with tp t being a 1-parameter family of fields satisfying ip e= o(x) = tp(x). Note that we can manipulate 
with S in a similar way as with normal derivatives (e.g. chain rule). Now, the Lagrangian C is defined 
such that the condition 

SS = (A.6) 

for 5ip = on the boundary dV leads to the equation of motion for ip. Thus the variation of the 
action (A. 4) with respect to <f> leads to the equation of motion for 4>, and the corresponding variation 
with respect to g^ v yields the Einstein field equations. In the following two sections we will compute 
these two variations explicitly. 



A.1.1. Variation with respect to the scalar field 

Varying the action (A. 4) with respect to <p yields 



5S 



= 5 [ C^dx i = [ (5C)^ddx 4 = [ {5C^)^dx 4 
Jv Jv Jv 



+ 



Then with 5(8^) = d^{5cp) = V„((ty) it holds 



and inserting this into Eq. (A. 7) we obtain 



V, 



5S 



V 



d4> 



-g dx 4 + 



Jv 



S(f> ) \f—g dx 4 



(A.7) 



(A.8) 



(A.9) 



By means of Gauss theorem the second integral is equivalent to a surface integral over &D, so it 
vanishes due to the boundary condition 8(j) = on dV. Since 5(f> and V are arbitrary, the condition 
5S = leads to 



v, 



-V„V</> + V = 



(A.10) 



with V' = dV/dcf). This is the Euler-Lagrange equation being the equation of motion for the 
scalar field (p. 1 



A.1.2. Variation with respect to the metric 

On the other hand, varying the action (A. 4) with respect to g^u such that 5g^ u = on dT> leads to 
the Einstein field equations and thus defines the energy-momentum tensor for the field <fi being 

5 [ C^gdx 4 = -\ [ [T^ u 5g^^gdx 4 . (A.12) 
Jd z Jv 

1 Note that in the special relativistic limit, i.e. g^ v — > jy M „, the equation of motion reduces for the potential V((j>) = 
m 2 (f) 2 /2 to the familiar Klein-Gordon equation 



{-V^d l "d v +m 2 )<j> = 4>- V 2 4> + m 2 4> = 



(A.ll) 
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To see see this, we need the identity (see e.g. Straumann 2004, Sect. 2.3) 

5 [ TZ^dx 4 = [ G^Sg^^dx 4 , (A.13) 
Jv Jv 

where G^ u is the Einstein tensor. Then using the Eqs. (A.13) and (A. 12), the variation of S becomes 



Jv 

5 / CuV~9 dx 4 + 5 / C^yJ^g dx 
Jv Jv 



= L ^G G ^^9dx 4 - \ JjWV^dx 4 



(A.14) 



Together with the condition 5S = for arbitrary 5g^ u and V we obtain the field equations 



(A.15) 



This justifies the definition of the energy-momentum tensor (A. 12). 

In order to compute the energy-momentum tensor explicitly, we need the variation of \J—g. To 
compute it, we need the auxiliary relation that any invertible, differentiable matrix M(x) satisfies 
(see e.g. Weinberg 1972, Sect. 4.7) 



tr 



M~ 



d_ 
dx 



M{x) 



d_ 

dx 



In | det M(x) | , 



so that it holds for the metric Sg/g = g^ u 5g^ u . This yields the variation of \f zr g as 



K\f ZI 9) = -\-^^>g = -^~= gg^Sg^ = g^Sg^ = g ^ g 



jJLV 



(A.16) 



(A-17) 



where in the last step we have used 

= 5 ($"„) = 8 {g^g av ) = Sg^g au + g^Sg av . (A.18) 
With the relation (A. 17), the variation of the left hand side of Eq. (A. 12) with respect to g^ v becomes 

-gdx A = [ \(5C 4> )^g + C(5^g) 
Jv L 

1 



5 f C d 
Jv 



dx 4 



V 

1 

2 .iv 



2 5g^d^d„<PV^g - \^v^~g dg» v 
- d^(j)d u 4> - Ctpg^v dg^ u yj^g dx 4 , 



dx 4 



(A.19) 



and it follows by comparison with the right hand side of Eq. (A. 12) the explicit form of the energy- 
momentum tensor: 2 



(A.22) 



2 In special relativity, the energy-momentum tensor T M „ for a field ip is usually derived by a symmetry argument 
(see e.g. Mandl & Shaw f993, Sect. 2.4). If the Lagrangian of the field £.(ip, d^if)) is invariant under spacetime 
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If (ft^(ft^ < 0, the energy-momentum tensor takes the form of an ideal fluid 

[T<j>]nv = {p<f,+P<f>) KU^^+P^V (A.23) 
with the effective energy density p^, effective pressure p^, and 4- velocity [u^] M being 



P< t> = - l -d»(ftd^ + v(cft), p^-ld^d^-v^), [u+Y 



(A.24) 



A.2. Scalar field in cosmology 

In this section we apply the results of the previous section to cosmology. First we consider a homoge- 
neous and isotropic FLRW universe and then a linearly perturbed universe as discussed in Chapter 
3. 

A.2.1. FLRW universe 

In an unperturbed FLRW universe using comoving coordinates, the scalar field can only depend on 
time, i.e. (ft(t, x) = (ft(t), due to the homogeneity of the universe. Moreover, since in a FLRW universe 
the energy-momentum tensor always takes the form of an ideal fluid, our energy-momentum tensor 
(A.23) has already the right form and we just have to evaluate the expressions (A.24) using the 
Robertson- Walker metric (1.7) and di<ft> = 0: 



P^ = \tf + V{(ft), V<t> = \tf-V{<i>)i W^ = Wm = (1,0,0,0). (A.25) 



So a scalar field has the time dependent equation of state 

Mt) _ y^m (A . 26) 

with the bounds — 1 < w^t) < 1. If the kinetic energy of the field is small compared to the potential 
V, i.e. (ft 2 <C V, it follows — —1. Thus such a fluid can mimic a cosmological constant A and 
leads, if it is the dominant energy component of the universe, to an exponential expansion of the 
universe (see Sect. 1.3.2). Both kinds of accelerations, inflation as well as the recent acceleration by 
dark energy, could in principle be caused by a scalar field. 

If the scalar field (ft does not interact with any other energy component in the universe, its equa- 
tion of motion is either given by the Euler-Lagrange equation (A. 10) or by the energy-momentum 



translations, then the quantity 

T ^ = d^P) d ^ + Cv ^ (A - 20) 

is conserved as a consequence of Noether's theorem and thus is interpreted as energy-momentum tensor. This 
criterion of translational invariance is satisfied by the Lagrangian (A.l) of the scalar field (ft and leads with the 
formula (A. 20) after generalizing to curved spacetime, i.e. ?? M „ — > g M „, to the same energy-momentum tensor as in 
Eq. (A.22): 

T M „ = — - d v <j> + Ld.g^v = g^,d~' (f>d v (ft + £<*gw = d^4>d u (j> + C&g^ . (A. 21) 

<3(<7 M <p) 

This confirms our general relativistic approach by means of an action variation. 
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conservation V \ l [T l j ) Y u = 0. Both approaches yield the same result. Since we have already computed 
V^fT^]^ = (see Eq. (1.24)), we can just insert the expressions (A. 25) into Eq. (1.24) yielding 



6 + 3Hd> + V' = . 



(A.27) 



This is the equation of motion for a scalar field in a FLRW universe. 

A.2.2. Perturbed universe 

In the following we extend our discussion by considering a perturbed universe as in Chapter 3. We 
adopt the corresponding notation and use conformal time r (see Eq. (1.8)) instead of cosmic time t. 
The perturbed scalar field is 

<f> = $ + 6<i>, (A.28) 

where 4>(r) denotes the field of the background FLRW universe and 5<p(x) is a small perturbation 
to be treated at first order. We want to express the energy-momentum perturbation of the scalar 
field 5^]^ in the form of Eq. (3.13)) in terms of the field perturbation 6<j>. With the inverse of the 
perturbed metric (3.7) which is at first order 

g 00 = -a" 2 (1 - 2A) , g i0 = g 0i = a~ 2 B i , g ij = oT 2 [ (1 - 2H L ) 6 ij - H ij ] (A.29) 
and with d^<p = (</) + d(fi, diS<f>) we have again at first order 

d^cttd^ = g^d^dvcf) = ~ (1 - 2A) (4> + 5<f>) 2 = ^ (-$ 2 -2$5<j> + 2A$ 2 ^j . (A.30) 
Thus we obtain for the first two expressions in Eq. (A. 24) 



P<t> = ~2^W + v (<i>) = ^ + V{4>) + ($6<i> - A4> 2 ) + v'l . 

1 1 ± 1 /± • ± \ - (A.31) 

P4> = - j^W " = ^ ~ V $) + ~2 " M 2 ) ~ VtfW , 

where we used the first order expansion V(<f> + 5(f)) = V(<fi) + V'(4>)5<p. Furthermore, with 

yj-d^^d^ P4>+P<t, 

it follows 

[T</>]oi = (Pcf, + P4>) [u^oKli + goiP<f> = 4> di5<p + a 2 Bip^ = <p di5(p + c^Bip^ . (A.33) 
Comparing this expression to the second equation of (3.13) yields 

(pt + Pt) {Bi + [v^i) = -^$di5<P . (A.34) 

In Fourier space, we can decompose the energy-momentum perturbations ^[T^]^^ into scalar, vector, 
and tensor modes using Eq. (3.39). Since <f> is a scalar field, 5(j) is a 3-scalar under spatial rotations 
due to the rotational invariance of the background FLRW universe. So it holds 5<f> = 6^ with the 
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result that vector or tensor modes cannot be produced by a scalar field. Using the Eqs. (A. 31) and 
(A. 34) and decomposing them according to Eq. (3.39) the scalar modes are then explicitly given by 





a z V 




<^ 2 ) + y'(0)^(°) 




a z V 




<^ 2 ) - y'(0)^(°) 


to+p*) (^ (0 M 0) ) 


1 






ni 0) 


= 0. 







Inserting these expressions into the equations of motion (3.53) yields 

continuity: 50 + 2U5^ + (fc 2 + aV) 6<j> = (i (0) - 3ij[ 0) + fefl^j - 2a 2 V'A^ , 



(A.35) 
(A.36) 
(A.37) 
(A.38) 



(A.39) 



whereas the Euler equation just reduces to Eq. (A. 27) for the unperturbed field cj). 
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